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104798 
134082 
119180 
107741 



AA010210 



121980 
100898 
121626 
133670 
131879 
100254 
133194 
106081 
115544 
119955 
104407 
135019 
114815 
119471 
117788 
119406 
130777 
130494 
104107 
121483 
104451 
118027 
109419 
115783 
110585 
123165 
103966 



104078 
117624 
112421 



R80413 

AA016982 

AA335223 

R22035 

AA338729 



T88798 

AF009674 

AA191495 

S72043 

J04599 

K02765 

N36001 

T90303 

L19871 

AA431320 



W80464 

W73926 

R00841 

R44155 

AA429B86 

H64638-HT5050 

AM16974 

AA243416 

AA017161 

D38037 

AA291726 

M418394 

AA351433 

W87460 

H61361 



M161488 



T95064 

R61742 

L13197 

AA424111 

AA411981 

M13299 

N52770 

AA227560 

M424487 

H62223 



Hs.47041 
Hs.17235 
Hs.79006 
Hs.92520 
Hs.64341 
Hs.75558 
Hs.23331 
Hs.133096 
Hs.90973 



Hs.380 
Hs.103779 
Hs.79706 
Hs.84183 



Hs.4947 
Hs.172069 
Hs.1 12218 
Hs.1 10407 



Hs.75470 
Hs.33792 
Hs.77643 
Hs.67201 
Hs.25354 
Hs.66187 
Hs.58989 
Hs.102171 
Hs.98428 
Hs.103931 
Hs.55445 
Hs.46849 
Hs.1 93771 
Hs.256554 
Hs.75874 
Hs.1 2598 
Hs.25274 
Hs.102119 



AA497026 
W92811 
AA431456 
AA491285 



Hs.1 05216 
Hs.127270 
Hs.21192 
Hs.22313 
Hs.1 18926 
Hs.222010 
Hs.82364 
Hs.23127 
Hs.22059 
Hs.1 83927 
Hs.98736 
Hs.1 75144 



ESTs 

pepsinogen 5; group I (pepsinogen A) 

ESTs 

ESTs 

ESTs 

Major Histocompatibility Complex, Class li, Dr Beta 2 (Gb:X65561) 
plasminogen activator inhibitor; type ! 



metallothionein 3 (growth inhibitory factor (neurotrophic)) 
biglycan 
complement component 3 

ESTs; Wkly smlr to !! ALU SUBFAMILY SQ WARNING ENTRY !! [H.sapiens] 

ESTs; Wkly smlr to leucine-rich gliorna-inactivated prot precursor [H.sapiens] 

activating transcription factor 3 

ESTs 

EST 

solute carrier family 1 (glial high affinity glutamate transporter); member 2 

ESTs; Weakly similar to envelope protein [H.sapiens] 

plectin 1 ; intermediate filament binding protein; SOOkD 

diptheria toxin resistance protein required for diphthamide 

biosynthesis (Saccharomyces)-llke 1 

ESTs; Moderately similar to Pro-a2(XI) [H.sapiens] 

ESTs; Wkly smlr to alternatively spliced product using exon 13A [H.sapiens] 

ESTs 

DKFZP434C212 protein 
ESTs 

ESTs; Weakly similar to coded for by C. elegans cDNA yk173c12.5 [C.elegans] 

Spliceosomal Protein Sap 49 

ESTs 

hypothetical protein; expressed in osteoblast 
ESTs 

FK506-binding protein IB (12.6 kD) 



Homo sapiens clone 23700 mRNA sequence 
ESTs 

immunoglobulin superfamily containing leucine-rich repeat 
Human Hox22 gene for a homeobox protein 
DKFZP434B0335 protein 



EST 
ESTs 

pregnancy-associated plasma protein A 

T-cell lymphoma invasion and metastasis 2 

ESTs; Modly smlr to putative seven pass transmembrane prot [H.sapiens] 

blue cone pigment 

thymosin; beta 4; X chromosome 

receptor-interacting serine-threonine kinase 3 

ESTs; Weakly similar to LIV-1 protein [H.sapiens] 

ESTs; Wkly smlr to !!!ALU SUBFAMILY SB1 WARNING ENTRY !!![H.sapiensl 

ESTs; Weakly smlr to HALU SUBFAMILY J WARNING ENTRY !! [H.sapiens] 

ESTs 

Homo sapiens clone 25155 mRNA sequence 
ESTs 

DKFZP586K091 9 protein 
ESTs 



ESTs; Weakly similar to I! ALU SUBFAMILY J WARNING ENTRY I! [H.sapiens] 



0.073 
0.073 
0.073 
0.073 



0.074 
0.074 
0.074 
0.074 
0.074 



0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.075 
0.Q75 
0.076 



0.076 
0.076 
0.076 
0.076 
0.076 
0.076 
0.076 
0.076 
0.076 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
0.077 
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131833 
119219 
135415 



124072 
126151 
119035 
110157 
128515 



25 134714 
129905 
120421 
100385 
102789 

30 120139 
135238 
129518 
132960 
108751 

35 134060 
111338 
112345 
126456 

40 128937 
103485 
111202 



129156 
121171 
103864 
128591 
122172 
112802 
107723 
113011 
131279 
103190 



AA460257 

J03909 

AI095718 



AA232646 

H96670 

AA609614 



R01779 
H18987 
M149044 



T86796 

AA236166 

HG4490-HT4876 

U86759 

Z39273 

U76343 

N54845 

AA609742 

M127063 

D42039 

N79778 



Y08409 

N68280 

AA429890 

X98085 

U65581 

U70867 

R01127 

Ml 6938 



55 108785 M128999 



AA028195 

AA400008 

AA207264 

AA255537 

AA435753 

R97647 

AA015967 

T23737 



Hs.206501 

Hs.15866 

Hs.14623 

Hs.135015 

Hs.119273 

Hs.125139 

Hs.32454 

Hs.32973 

Hs.1 10783 

Hs.99967 

Hs.68061 

Hs.42221 

Hs.5241 

Hs.251613 

Hs.101637 

Hs.40808 

Hs.7740 

Hs.169731 

Hs.10086 

Hs.6430 

Hs.24865 

Hs.71848 

Hs.890 

Hs.132875 

Hs.1 32957 

Hs.158336 

Hs.77876 

Hs.96970 

Hs.1 73030 

Hs.6150 

Hs.203717 

Hs.78871 

Hs.35094 



HS.10726 

Hs248415 

Hs.107922 

Hs.166066 

Hs.54433 

Hs.159191 

Hs.83974 

Hs.1 91 04 

Hs.820 

Hs.12725 

Hs.75774 

Hs.29189 

Hs.172856 

Hs.81001 

Hs.97058 



Hs.108973 

Hs.161814 

Hs.181077 

Hs.102057 

HS.161B54 

Hs. 174855 

Hs.60680 

Hs.1600 

Hs.25197 

Hs.58414 



le (neutral sphingomyelinase) 0.077 

KIAA0255 gene product 0.077 

Homo sapiens clone 643 unknown mRNA; complete sequence 0.078 

ESTs 0.078 

interferon; gamma-inducible protein 30 0.078 

ESTs 0.078 

KIAA0296 gene product 0.078 

ESTs 0.078 

ESTs; Moderately similar to KIAA0544 protein [H.sapiens] 0.078 

glycine receptor; beta 0.078 

ESTs 0.078 

even-skipped homeo box 1 (homolog of Drosophila) 0.078 

ESTs; Weakly similar to sphingosine kinase [M.musculus] 0.078 

ESTs 0.078 

ESTs 0.078 

EST 0.078 

EST; Weakly similar to hypothetical protein [H.sapiens] 0.078 

ESTs 0.078 

ESTs 0.078 

ESTs 0.078 

ESTs; Highly similar to HYPOTHETICAL PROTEIN Kl AA0195 [H.sapiens] 0.078 

protein with polyglutamlne repeat 0.078 

ESTs ~ 0.078 

Human clone 23548 mRNA sequence 0.078 
lymphotoxin beta (TNF superfamily; member 3) 
ESTs; Weakly similar to predicted using Genefinder [C.elegans] 
ESTs; Weakly similar to chondromodulin-l precursor [H.sapieris] 
Proline-Rich Protein Prb4, Allele 
netrin 2 (chicken)-like 

Human DNA from chromosome 19-specific cosmid R30923; genomic sequence 
Human liver GABA transport protein mRNA; 3' end 



ESTs 
KIAA0521 protein 



za56d02.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 

IMAGEa96547 5', mRNA sequence. 

ESTs 

thyroid hormone responsive SP0T14 (rat) homolog 



tenascin R (restrictin; janusin) 
ribosomal protein L3-Iike 

solute carrier family 21 (prostaglandin transporter); member2 



0.079 
0.079 
0.079 
0.079 
0.079 
0.079 
0.079 
0.079 
0.079 
0.079 
0.079 
0,079 



0.079 
0.079 
0.079 
0.079 
0.079 
0.079 



KIAA1021 protein 
ESTs 

ESTs; Weakly similar to DY3.6 [C.elegans] 

EST; Highly similar to CMP-N-acetylneuraminic acid hydroxylase [H.sapiens] 

zo8f12.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens 

cDNA clone IMAGE:567119 3', mRNA sequence 

KIAA1066 protein; JSAP1 homolog (mouse); JIP3 homolog (mouse) 

ESTs 

Interteukin 8 

dolichyl-phosphate mannosyltransferase polypeptide 2; regulatory subunit 
ESTs 

ESTs; Weakly similar to Miller-Dieker lissencephaly gene [H.sapiens] 
ESTs; Weakly similar to O-linked GlcNAc transferase [H.sapiens] 



chaperonin containing TCP1; subunit 5 (epsilon) 
STIP1 homology and U-Box containing protein 1 
filamin C; gamma (actin-binding protein-280) 



0.081 
0.031 
0.081 
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(cat#936206) Homo sapiens cDNA 



101645 
130536 
107732 
123071 
113537 
101250 
122521 
133914 
102038 
110336 
118637 
117966 



112776 
105494 
117000 



123776 
114454 
101246 



H43286 

HG3945-HT4215 

M1 15240 

AA234259 

M77349 

M59807 

T17045 

AA016181 

AA482593 

T90457 



N58054 
R45977 
U48437 



AA429452 

M80647 

AA458674 



AA610071 

AA021091 

L33799 

U78310 

T89601 

AA1 12209 

AA278767 

M490878 

T23724 

AA431847 

R17939 

U38268 

R98774 



Hs.167017 

Hs.61816 

Hs.99816 

Hs.1 18787 

Hs.943 

Hs.159492 

Hs.59752 

Hs.104285 

Hs.191293 

Hs.79133 

Hs.1 49227 

Hs.77542 

Hs.477 

Hs.174094 

Hs.49822 

Hs.94012 

Hs.182320 

Hs.1 25078 

Hs.1 01 174 

Hs.57734 

Hs.3781 

Hs.1 2807 

Hs.28774 



Hs.74565 

Hs.1 34296 

Hs.45203 

Hs.204144 

Hs.34494 

Hs.29288 

Hs.1 12236 

Hs.133151 

Hs.1 074 

Hs.39960 



Hs.43509 
Hs.47413 
Hs.2679 

Hs.88599 
Hs.206704 
Hs.9315 

Hs.1 1281 3 
Hs.226208 
Hs.202097 
Hs.13501 
Hs.95497 

Hs.1209 
Hs.914 
Hs.1 11 334 



ESTs 

EST01 884 Fetal brain, 
clone HFBCH10, mRNAseqi 
gamma-aminobutyric add (GABA) B receptor; 1 
Phospholipid Transfer Protein 
ESTs 
ESTs 

transforming growth factor; beta-induced; 68kD 
natural killer cell transcript 4 
spastic ataxia of Charlevoix-Saguenay (sacsin) 
ESTs 
ESTs 
ESTs 
cadherin8 

ESTs; Weakly similar to PROLINE-RICH PROTEIN MP-3 [M.musculus] 
ESTs 

hydroxysteroid (17-beta) dehydrogenase 3 

ESTs; Weakly similar to !! ALU SUBFAMILY J WARNING ENTRY !! [H.sapiens] 
ESTs 

ESTs 0.082 

ESTs; Weakly similar to Mouse i9JS mRNA; complete cds [M.musculus] 0.082 

Human mRNA for ornithine decarboxylase antizyme; ORF 1 and ORF 2 0.082 

microtubute-associated protein tau 0.082 

KIAA01 48 gene product 0.082 

Homo sapiens BAC clone RG1 18D07 from 7q31 0.082 

ESTs 0.082 

ESTs . 0.082 

ESTs 0.082 

ribosomal protein L18a 0.082 

amyloid beta (A4) precursor-like protein 1 0.082 

ESTs 0.032 

ESTs 0.082 
ESTs; Modly smlr to tumor necrosis factor-alpha-induced prot B12 [H^apiens] 0.082 

ESTs 0.082 

Homo sapiens mRNA; cDNA DKFZp434P174 (from clone DKFZp434P174) 0.082 

ESTs; Weakly similar to repressor protein [H.sapiens] 0.082 

transient receptor potential channel 7 0.082 

surfactant; pulmonary-associated protein C 0.083 

ESTs 0.083 

Human complement C1 q B-chain gene, exon A+1 0.083 

ESTs 0.083 

thromboxane A synthase 1 (platelet; cytochrome P450; subfamily V) 0.083 

EST 0.083 
zm2d1 ,s1 Stratagene corneal stroma (8937222) Homo sapiens cDN A clone 
IMAGE.-512947 3' similar to TR:E198281 E198281 THIOREDOXIN 

REDUCTASE ;contains Alu repetitive element;, mRNA sequence 0.083 

ataxin 2 related protein 0.083 

ESTs 0.083 
gap Junction protein; beta 1; 32kD (connexin 32; Charcot-Marie-Tooth 

neuropathy; X-linked) 0.083 

ESTs 0.083 

ESTs - 0.083 
ESTs; Weakly similar to N EURONAL OLFACTOMEDIN-RELATED 

ER LOCALIZED PROTEIN [H.sapiens] 0.083 

ESTs 0.083 

ESTs 0.083 

procollagen C-endopeptidase enhancer 0.083 

pescadillo (zebrafish) homolog 1 ; containing B RCT domain 0.083 
ESTs; Weakly similar to GLUCOSE TRANSPORTER TYPE 5; 

SMALL INTESTINE (H.sapiens] 0.083 

acyl-Coenzyme A dehydrogenase; long chain 0.083 

Human mRNA for SB classll histocompatibility antigen alpha-chain 0.083 

ferritin; light polypeptide 0.083 



ESTs; Highly similar to CGI- 146 protein [Rsapiens] 
ESTs 

Human cytochrome b pseudogene, partial cds 



0.083 
0.083 
0.084 
0.084 
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T67285 
N49947 
T98484 
C00424 



Hs.34024 
Hs.14541 



Hs.18377 
Hs.7101 
Hs.124838 
Hs.8265 



Human clone W2-6 mRNA from chromosome X 
ajlfin 1 

zm89e5.s1 Stratagene ovarian cancer (#93721 9) H sapiens cDNA clone 

IMAGE:54512 3' similar to gb:X14723 CLUSTERIN PRECURSOR 

(HUMAN);, mRNA sequence 

ESTs 

ESTs 

EST 

periodontal ligament fibroblast protein 
ESTs 

transglutaminase 2 (C polypeptide; protein-glutamine 



0.084 
0.084 
0.084 
0.084 
0.084 
0.084 



109284 
25 116689 
100545 
102634 
111735 
105181 
30 122681 
114543 
133597 
121064 



45 128514 
123099 
134067 



50 114395 
107465 
101983 
112544 
111423 

55 127918 
107300 
134947 



124579 
130471 



Hs.103535 

Hs.19813 

Hs.727 

Hs.1 10407 

Hs.86092 

Hs.66099 

Hs.77667 



Hs.158419 
Hs.75139 
Hs.97406 
Hs.1 97728 
Hs.95659 



Hs.183669 

Hs.96963 

Hs.35124 

Hs.101661 

Hs.75511 

Hs.100843 

Hs.79 

Hs.78920 

Hs.40124 

Hs.89563 

Hs.110155 

Hs.251385 

Hs.75323 

Hs.29153 

Hs.188507 

Hs.1 15396 

Hs.90488 



Hs.146428 
Hs.157441 
Hs.96417 
Hs.20432 



myosin; heavy polypeptide 8; skeletal muscle; perinatal 

ESTs; Weakly similar to ION CHANNEL HOMOLOG RIC 

PRECURSOR [M.musculus] 

ESTs 

ESTs 

inhibin; beta A (activin A; activin AB alpha polypeptide} 

ESTs; Weakly similar to coded for by C. elegansfcDNA yk173c12.5 [C.elegans] 

ESTs 

ESTs 

Mucin 3, Intestinal (Gb:M55405) 

lymphocyte antigen 6 complex; locus E 

ESTs; Weakly similar to FAST kinase [H.sapiens] 

ESTs; Moderately similar to unknown [R.norvegicus] 



partner of RAC1 (arfaptin 2) 
ESTs 

ESTs; Weakly similar to ZINC FINGER PROTEIN 135 [H.sapiens] 

lethal giant larvae (Drosophila) homolog 1 

gastrin-releasing peptide receptor 

ESTs 

ESTs 

GTP binding protein 1 

ESTs; Weakly similar to !! ALU SUBFAMILY J WARNING ENTRY !! [H.sapiens] 

EST 

ESTs 

ESTs 

connective tissue growth factor 

ESTs; Weakly similar to sMar to GTP-binding protein [Celegans] 



0.085 
0.085 
0.085 
0.085 



0.035 
0.085 
0.085 
0.085 



Rab geranylgeranyltransferase; alpha subunit 
EST 

nuclear cap binding protein 1 ; 80kD 
ESTs 

murine retrovirus integration site 1 homolog 

prohibiSn 

ESTs 

ESTs 

Human germline IgD chain gene; C-region; C-delta-1 domain 
ESTs 

yj71 a08.r1 Soares breast 2NbHBst Homo sapiens cDNA clone IMAGE:1 54166 

5' similar to gb:L1 1284 DUAL SPECIFICITY MITOGEN-ACTIVATED PROTEIN 

KINASE KINASE 1 (HUMAN);, mRNA sequence. 

ESTs; Weakly similar to TERATOCARCINOMA-DERIVED GROWTH 

FACTOR 1 [H.sapiens] 

adducin 1 (alpha) 

ESTs 

ESTs; Weakly similar to ZFOC1 gene product [H.sapiens] 

Human nicotinamide N-methyltransferase gene, exon 1 and 5' flanking region 

collagen; type V; alpha 1 

spleen focus forming virus (SFFV) proviral integration oncogene spil 

ESTs 

ESTs 
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0.085 
0.086 
0.086 
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Hs.125416 
Hs.1 85182 
Hs.171900 



116099 
100721 
112569 
130645 



108844 
129874 
105139 
124789 
115923 
123640 
131607 



104642 
131752 
114727 



106218 
111562 
121219 
101187 
101513 
116454 
116171 
117500 
119978 
132005 
109914 



130404 
114072 
131470 
124573 
114717 
133806 
130470 



W73859 
AA227941 
T15965 

HG3636-HT3846 

T56800 

W23761 

AA45S309 

HG3355-HT3532 

R73150 

AA020942 

HG3527-HT3721 

M27161 



U66061 
H04992 
Z20777 



D84361 

AA428451 

R09567 

AA400606 

L20316 

M28210 

AA621071 



AF009801 

AM17181 

AA447545 

AA365356 

X72012 

Z38184 

X54938 



Hs.1 55829 
Hs.78601 
Hs.241395 
Hs.30499 
Hs,9857 

Hs.6845 



Hs.97340 
Hs.177961 
Hs.181551 
Hs.1 10082 
Hs.78110 
Hs.38205 
Hs.1 12681 
Hs.1 72740 
Hs.1 81 125 
Hs.71055 
Hs.1 08211 
Hs.143513 
Hs.1 84245 
Hs.31566 



Hs.179715 

Hs.1 51 123 

Hs.91146 

Hs.1 87569 

Hs.144344 

Hs.208 

Hs.27744 

Hs.42034 

Hs.42658 

Hs.44278 

Hs.59190 

Hs.1 73091 

Hs.1 94704 

Hs.155140 

Hs.1 05941 

Hs.1 20858 

Hs.1 8268 

Hs.97041 

Hs.76753 

Hs.1 23333 

Hs.2722 

Hs.1 94703 

Hs.252014 

HS.7B325 

Hs.1 5711 

Hs.240135 

Hs.43866 



ESTs 

ESTs; Highly similar to 40S RIBOSOMAL PROTEIN S10 [H.sapiensl 

armadillo repeat gene deletes in velocardiofacial syndrome 

CCAAT/enhancer binding protein (C/EBP); epsilon 

transcription factor 21 

ESTs 

ESTs 

Myosin, Heavy Polypeptide 9, Non-Muscle 

Homo sapiens mRNA; cDNA DKFZp564B176 (from clone DKFZp564B176) 
ESTs 

regulator of Fas-induced apoptosis 

Peroxisome Proliferator Activated Receptor (GbZ30972) 

GTP-binding protein homologous to Saccharomyces cerevisiae SEC4 

STAM-like protein containing SH3 and ITAM domains 2 

Luteinizing Hormone, Beta Subunrt 

CD8 antigen; alpha polypeptide (p32) 

adenomatous polyposis coll like 

small proline-rich protein 2A 

KIAA0676 protein 

uropoiphyrinogen decarboxylase 

protease; serine; 1 (trypsin 1) 

ESTs 

ESTs; Weakly similar to peroxisomal short-chant alcohol 

dehydrogenase [H.sapiens] 

ESTs 

zn12c5.s1 Stratagene hNT neuron (#937233) H sapiens cDNA clone 
IMAGE:54728 3' similar to TR:G1151228 G1151228 LPG1P. ;, mRNAseq 
EST 

Human Chromosome 16 BAC clone CIT987SK-A-388D4 

ESTs 

ESTs 

ESTs; Weakly similar to F17A9.2 [Celegans] 

ESTs 

ESTs 

microtubule-associated protein; RP/EB family; member 3 



ESTs 

phosphoprotein regulated by mitogenic pathways 

KIAA0929 protein Msx2 interacting nuclear target (MINT) homolog 

ESTs 

ESTs 

ESTs 

Human mRNA for p52 and p64 isoforms of N-Shc; complete cds 



ESTs 
EST 

glucagon receptor 

RAB3A; member RAS oncogene family 

ESTs; Moderately similar to T-complex protein 10A IH.sapiens] 

ESTs 

ESTs 

EST 

DKFZP434K151 protein 
leucine-rich; glioma inactivated 1 
casein kinase 2; alpha 1 polypeptide 
bagppe homeobox (Drosophila) homolog 1 
ESTs 

adenylate kinase 5 
ESTs 

endoglin (Oster-Rendu-Weber syndrome 1) 
ESTs 

inositol 1;4;5-trisphosphate3-kinase A 
adaptor-related protein complex 4; mu 1 subunit 
EST 

Human Ig J chain gens 

KIAA0639 protein 

H4 histone family; member J 

ESTs 



0.086 
0.086 
0.085 
0.086 
0.085 
0.086 



0.087 
0.087 
0.087 



0.087 
0.087 
0.087 
0.087 
0.D87 



0.087 
0.087 
0.087 
0.087 



0.088 
0.088 
0.088 



0.088 
0.088 
0.088 



0.089 
0.089 
0.089 
0.089 
0.089 
0.089 
0.089 
0.089 
0.089 
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122695 
125975 
110783 
129860 
120740 
119564 
134474 
119014 
109791 
117605 
121589 
104326 
129861 
102795 
119626 
110516 
105382 
123754 
108008 
121057 
123675 
135194 
127070 
134051 
133382 
103615 
118457 
118504 
112915 



120744 
112226 
116154 
102640 
129797 
102705 
132408 
108441 



AA495B91 

N23669 

AA410343 



W38206 
AA054746 
N95435 



N69507 

U88667 

W49499 

H55894 

AA236B53 

M609964 

AA039430 

AA398619 

AA609474 

C20975 

AA641812 

S67070 

Ml 12532 

Z46967 

N66593 

N67334 

T10176 

M470121 



AA251505 
U32S45 
AA065169 
H16076 



W38150 

HG4113-HT4383 
Ml 32509 
T57738 
T03148 
M521407 



M302772 

R50761 

M460951 

U67674 

X53595 

U77180 

M035547 

M079079 



Hs.152290 
Hs.26407 
Hs.1 29826 
Hs.96654 



Hs.44433 
Hs.191598 
Hs.1 43067 



Hs.1 98396 
Hs.184456 
Hs.37368 
Hs.1 11801 
Hs.102021 
Hs.61920 
Hs.142375 
Hs.112713 



Hs.50158 

Hs.4254 

Hs.243960 

Hs.248156 

Hs.29074 

Hs.237323 

Hs.1 15960 



Hs.77515 

Hs.7303 

Hs.1 30861 

Hs.151139 

Hs.17805 

Hs.6915 

Hs.99741 

Hs.1 12861 

Hs.76907 



Hs.103827 
Hs.174112 
Hs.4610 
Hs.1 10095 
Hs.79021 



Hs.23578 

Hs.23954 

Hs.4775 

Hs.24185 

Hs.26016 

Hs.16258 

Hs.228649 

Hs.25738 

Hs.57100 

Hs.194783 

Hs.1252 

Hs.50002 

Hs.47822 



ESTs 

ESTs; Modarately similar to undulin 2 [Rsaplens] 

ESTs; Highly similar to PACAP type-3/VIP type-2 receptor [H.sapiens] 

ESTs 

tetraspan transmembrane 4 super family 
EST 

Accession not listed in Genbank 



DRE-antagonist modulator; oalsenilin 

ESTs 

ESTs 



ATP-binding cassette; sub-family A (ABC1); member 4 0.09 

ESTs; Wkly smlr to !! ALU SUBFAMILY SX WARNING ENTRY !! [H.sapiens] 0.09 

EST 0.09 

Homo sapiens mRNA; cDNA DKFZp564H2023 (from clone DKFZp564H2023) 0.09 



ESTs; Highly similar to angiopoietin-related proflin [H.sapiens] 
ESTs 

heat shock 27kD protein 2 
ESTs 



ESTs 
ESTs 

HLA-B associated transcript-3 



N-acetylglucosamine-phosphate mutase; DKFZP434B1 87 protein 
ESTs 

ESTs; Moderately similar to CL3BC [R.norvegicus] 
inositol 1 ;4;5-triphosphate receptor; type 3 
ESTs; Weakly similar to PROHIBITS [Ksapiens] 
ESTs 

E74-lifce factor4 (ets domain transcription factor) 



ESTs; Weakly similar to BRAIN PROTEIN H5 [H.sapiens] 
ESTs 

ESTs; Highly similar to HSPC002 [H.sapiens] 
ESTs 

Accession not listed in Genbank 

Olfactory Receptor 0r17-201 

EST 

ESTs 

EST 

ESTs 

tafazzin (cardiomyopathy; dilated 3A (X-linked); endocardial 

fibroelastosis 2; Barth syndrome) 

ESTs 

ESTs 

ESTs; Weakly similar to KIM0584 protein [H.sapiens] 



ESTs; Weakly similar to RAS-RELATED PROTEIN RAB-SA [H.sapiens] 

EST 

ESTs 

ESTs 

solute carrier family 10 (sodium/bile acid cotransporter family); member 2 

apolipoprotein H (bsta-2-glycoprotein I) 

small Inducible cytokine subfamily A (Cys-Cys); member 19 

KIM0380 gene product; RhoA-spedfic guanine nucleotide exchange factor 

zm97c9.s1 Stratagene colon HT29 (#937221) Homo sapiens cDNA clone 



0.092 
0.092 
0.092 
0.092 
0.092 
0.092 
0.092 
0.092 



0.092 
0.092 
0.092 
0.092 
0.092 
0.092 
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131405 
112909 
124173 



105413 
111711 
117595 
113813 
107769 

114966 



AA054133 
AA449990 
M64358 
AA401958 



U79255 
T10069 
H41281 



IMAGE:545872 3' similar to contains element MER22 MER22 repetitive 

element ;, mRNA sequence 

ESTs 



AA043894 

AA281929 

N20535 

AA404597 

L47738 

S69265 

AA497031 



H52105 

AA2071O5 

L77564 

AA424530 

U77594 

U85773 



X14767 

N69440 

AA436198 

M907314 

M053022 

AA878398 

M58285 

X92972 

AA477561 

M196979 



Hs.125201 
Hs.40780 
Hs.2499 



Hs.159637 

Hs.6311 

Hs.7093 



Hs.154299 
Hs.112193 
Hs.220324 



Hs.143974 
Hs.43265 
Hs.7133 
Hs.80313 



Hs.8309 

Hs.45068 

Hs.103978 

Hs.21061 

Hs.37682 

Hs.154695 

Hs.195292 

Hs.93002 

Hs.1 06893 

Hs.2430 

Hs.82482 

Hs.60918 

Hs.1 12614 

Hs.56421 

Hs.42391 

Hs.64746 

Hs.9779 

Hs.1 22511 

Hs.1 50443 

Hs.89768 

Hs.38132 

Hs.103902 

Hs.188905 

Hs.8312 



Hs.80324 
Hs.6449 
Hs.104129 



Human rhom-3 gene, exon 

ESTs; Moderately similar to alternatively spliced product using 

exon13A [H.sapiens] 

ESTs; Weakly similar to B7 [M.musculus] 

ESTs 

protein kinase C-like 1 

amyloid beta (A4) precursor protein-binding; family A; member 2 (XH-like) 

ESTs 

ESTs 

ESTs 

valyl-tRNA synthetase 2 



ESTs 

Homo sapiens DNAfram chromosome 19-cosmids R30102:R29350:R27740 

containing MEF2B; genomic sequence 

ESTs; Highly similar to calcium-regulated heat stable protein 

CRHSP-24[H.sapiens] 

trophinin-assisting protein (tastin) 

ESTs 

ESTs 

coagulation factor II (thrombin) receptor-like 1 
mutS(E.coli)homolog5 

Human DNA sequence from clone 283E3 on chromosome 1p36.21-36.33. 

Contains the alternatively spliced gene for Matrix Metalloproteinase in the 

Female Reproductive tract MIFR1 ; -2; MMP21/22A; -B and -C; a novel gene; 

the alternatively spliced CDC2L2 gene for 

ESTs 

ESTs 

melastatin 1 
ESTs 

p53 inducible protein 

Homo sapiens mRNA for PLE21 protein; complete cds 
ESTs; Highly similar to CTG7a [H.sapiens] 

ESTs; Wkly smlr to glucose-6-phosphatase catalytic subunit [R.norvegicus] 
KIAA0747 protein 

Homo sapiens mRNA; cDNA DKFZp434H43 (from clone DKFZp434l143) 

serinerthreonine kinase 22B (spermiogenesls associated) 

ESTs 

retinoic acid receptor responder (tazarotene induced) 2 
phosphomannomulase 2 

ESTs; Weakly similar to RNA helicase HDB/DICE1 [H.sapiens] 

ubiquitin carrier protein E2-C 

ESTs 

transcription factor-like 1 
protein predicted by clone 23882 
EST 
EST 

ESTs; Weakly similar to Similarity to H.infiuenza ribonuclease PH [C.elegans] 
EST 

chloride intracellular channel 3 
ESTs 

centrin; EF-hand protein; 1 
KIAA0320 protein 

gamma-aminobutyric acid (GABA) A receptor; beta 1 



ESTs 
ESTs 
ESTs 

hematopoietic protein 1 

protein phosphatase 6; catalytic subunit 

ESTs 

ESTs; Weakly similar to protease [H.sapiens] 



0.093 
0.093 
0.093 



0.093 
0.093 
0.093 



0.094 
0.094 
0.094 
0.094 



0.094 
0.094 
0.094 
0.094 



0.094 
0.095 
0.095 
0.095 
0.095 



0.095 
0.095 
0.095 
0.095 
0.095 
0.095 
0.095 
0.095 
0.095 
0.095 
0.096 
0.0S6 
0.096 
0.0S6 
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AA009615 
AA521176 
- X74795 



103084 
124138 
130048 



119847 
112819 
55 131080 



127353 
132068 
105744 



M37033 
W76492 
AA453465 



AA398281 

AA521172 

AA463929 

T16814 

AM01372 

R44839 

R43754 



T95770 

AA179338 

Z40125 



H01824 

AA235988 

S75168 



U59286 
H46486 
AA49S792 



H86783 
GMCSF 

HG4458-HT4727 



Hs.257808 
HS221231 
Hs.77171 



Hs.134417 
Hs.28701 
Hs.6828 
Hs.97723 



Hs.48145 

Hs.3813 

Hs.93135 

Hs.143434 

Hs.22824 

Hs.191428 

Hs.25085 

Hs.17666 

Hs.5476 

Hs.91968 

Hs.5831 

Hs.80021 

Hs.185051 

Hs.47995 

Hs.169444 

Hs.760 



Hs.44914 
Hs.21860 
Hs.251108 
Hs.101506 
Hs.77793 
Hs.107010 
Hs.211612 
Hs.78224 
Hs.1 12589 
Hs.55092 
Hs.9853 



Hs.38481 
HS.12909 
Hs.101382 
Hs.178420 



ESTs 
ESTs 

minichromosome maintenance deficient (S. cerevisiae) 5 (cell division cycle 46) 

ESTs; Highly similar to ZINC FINGER PROTEIN 140 [H.sapiens] 

EST1 77475 Jurkat T-cells VI Homo sapiens eDNA 5' end, mRNA sequence. 

ESTs 

ESTs 

ESTs 

CD53 antigen 

ESTs 

ESTs 

KIAA0478 gene product 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 



dyrein; axonemal; light polypeptide 4 
Methylenetetrahydrofolate Reductase 
ESTs 



ESTs 
ESTs 

GATA-binding protein 2 
ESTs 

megakaryocyte-associated tyrosine kinase 

ESTs; Weakly similar to ATP(GTP)-binding protein [H.sapiens] 

ESTs 



Homo sapiens mRNA; chromosome 1 specific transcript KIAA0493 



ESTs 

cyclin-dependent kinase 6 
ESTs 

tumor necrosis factor; alpha-induced protein 2 

ESTs; Highly similar to WASP interacting protein [H.sapiens] 

small inducible cytokine subfamily B (Cys-X-Cys); member 11 



ribosomal protein S6 kinase; 90kD; polypeptide 5 

hexo kinase 3 (white cell) 

ESTs 

ESTs; Moderately similar to zinc finger protein RIN ZF [R.norvegicus] 

Accession not listed in Genbank 

Immunoglobulin Heavy Chain, Vdjc Regions (Gb:L23563) 



0.733 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.096 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.097 
0.098 
0.098 
0.098 
0.098 



0.098 
0.098 
0.098 
0.098 
0.098 

0^098 
0.098 
0.098 
0.098 
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123956 


AA621747 


Hs.1 12847 




0.099 




HG4018-HT4288 




OpiokJ-Binding Cell Adhesion Molecule 


0.099 




W47419 


Hs.56007 


Human DNA from chromosome 19-speciliccosmid F25965; genomic sequence 


0.099 




R 93 060 


Hs.35035 




0.099 


112748 


R93299 


Hs.166492 


ESTs 


0.099 








ESTs ant ' a6n 


0.099 


194R7n 


R69233 


Hs.101504 




0.099 






Hs.124940 




0.099 




AA401 995 


Hs.97860 




0.099 




AA046103 


Hs.102367 




0.099 






Hs.144524 


ESTs; Weakly similar to signal transducer and activator of 












0.099 




HG2149-HT2219 




Mucin (Gb:M57417) 


0.099 


105652 


AA2 82505 


Hs.19015 


ESTs 


0.099 


133390 


AA459945 


Hs.72660 


KIAA0585 protein 


0.099 








Fc fragment of IgE; high affinity I; receptor for; gamma polypeptide 


0.099 
0.099 




AA232667 


Hs.58210 








Hs.80776 




0.099 
0.099 


113464 


T86931 


Hs!l6295 


ESTs 


104240 


AB002368 


Hs.70500 


KIAA0370 protein 


0.099 




AA399109 


Hs 161813 




0.1 








ESTs; Weakly similar to dal2; len:343; CAI: 0.17fALC YEAST P25335 










ALLANTOICASE [S.cerevisiae] 




102405 


U43148 


Hs.159526 


patched (Drosophila) homolog 


o'i 


103599 


Z33905 


Hs.81218 


receptor-associated protein of the synapse; 43kD 


0.1 


121079 


AA398719 


Hs.14169 


ESTs; Weakly similar to CREB-binding protein [H.sapiens] 


0.1 


115820 


AA427487 


Hs.39619 


ESTs; Weakly similar to RETICULOCALBIN 1 PRECURSOR [H.sapiens] 


0.781 


125106 


T95766 


Hs.189760 


ESTs 


0.1 


131373 


N68116 


Hs.26146 


Down syndrome critical region gene 3 


0.1 


120224 


Z41239 


Hs.106960 


ESTs 


0.1 


133090 


M448228 


Hs.6468 


ESTs 


0.1 


132300 


M1 33244 


Hs.44234 


ESTs 


0.1 
0.1 


113129 
110638 


T49384 
H73197 


Hs.8988 
Hs.17241 


EST 
ESTs 


0.1 


131364 


R53255 


Hs.26010 


ESTs 


0.1 


105370 


AA236476 


HS22791 


ESTs; Weakly similar to transmembrane protein with EGF-like and two 










follistatin-like domains 1 [H.sapiens] 


0.238 
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TABLE 1 1 A shows the accession numbers for those primekeys lacking unigenelD's for 
Table 11. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
5 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey CAT number Accession 



100721 19818J 
100748 41861J 
100750 15759J 



AW161357 AI879062 AI928938 AW161097 AW161167 BE314465 AA351715 F0709S. M179034 F08510 F00653 AI936671 
M476718 AW772464 AI807703 R44253 M976667 AI985186 AI650254 H38942 R84829 AA018724 M001000 HB5934 
AA019126 H85609 AA017000 AA33935S AW950556 D51397 AA213981 BE548002 AI056359 AA0O156O AW9521 13 
AA317769 AI857477 AI857475 AW249771 AW162661 H38943 AA018628 R85885 AI9B4613 AI934765 AI796172 AW157488 
A1929191 R85523 D51221 D53851 H85610 AI749674 F21582 AA323145 AA019127 AA687444 T06745 AI699293 H29532 
AA214029 AA223S56 NM_016834 X14474 R19697 H09695 R174S5 R13312 R19056 AI6B1231 AI59020O R37671 AA861828 
AI990023 AI935669 AW005821 M324581 H17335 R37659 R42802 R46242 R60936 R59731 H28993 AA479907 R44570 
AI890696 M308884 AA507078 R41274 AI365S07 T16348 A1560453 F03259 F04722 T16312 AA016081 AW073061 
BE314824 W28930 R44098 R51045 



AI702367 BE408966 BE280458 BE313759 BE513492 BE535404 BE280258 AC005263 NM.007165 L21990 AW732711 
AI564920 AW249094 BE265365 AW607186 AW607348 BE005217 H27211 U46230 BE260066 BE207043 BE546782 
AW248659 
AA085228AA085161 

L40904 NM_005037 X90563 AB005526 H21598 AA088517 



BE157260 BE157265 R481 18 H43827 Z17877 AW379070 AW291778 M20605 J03253 M14206 V00568 AI860465 AW296022 
M13930 AL047400 J00120 BE018476 AW675223 T2S980 F06694 R22709 R24720 H22753 AI9O31O0 AI9O3094 AW937823 
X00364 D10493 K01904 K01906 K00535 L00058 AA410662 AW384760 AA304930 AI680985 XO0198 H58025 AW998901 
AV653447 N31654 AW610357 AW610369 AW862480 BE223010 AW384172 AW384219 AW384171 AW384218 AA29S522 
BE140421AW945162AVV751711AA514409AVV747912AI214214W87741M972406AA554513BE302087AI249030 
AA477850 AV653129 AI281360 AI274110 W87861 AA541366 X66258 AIO516O0 AA877139 AA527483 AA857219 AI250782 
AA625531 AA807892 AI27881 1 AI224033 H24033 AA593396 AW129709 R45453 N22772 AA235530 T29737 AI016409 
AI688907 AA568370 AA722760 AI539329 AA550843 AW574698 AI5384S2 AI538453 AI3379S7 AA477744 AA464600 
A1140319 AW949294 AI339781 AI828736 M923634 AA344094 AI278350 AA975567 AA908416 AA857170 AW023520 
R43413 R48004 F02958 AI989439 R11207 AA737307 D10493 AW950652 AI093842 AI474024 AA703369 R1 1264 M13930 
M13930 M13930 M13930 M13930 J00120 M13930 M13930 X00364 J00120 R19507 AA639812 
N32759 N29730 N30831 N32604 N31955 AI206390 H87574 R23494 AI186215 N30036 AI741S12 J00117 NM.000737 
AI453626 AA330974 AI188729 AI188604 AI188964 N30276 AI188947 AI18883D AI188303 AI200457 AI219166 AI192459 
AI183280 AI18927S AI188639 AI186353 AI189616 AI184224 AI130720 AI188454 AI188391 AI148857 AI192447 AI209155 
AI190013 AI206355 AI188721 AI189429 AI189364 AI18B330 AI431595 AI189595 AI188781 AI148847 AI200022 AI221552 
AI220923 A1188728 AA233034 AI189807 AI189641 AI219044 AI148774 AI200658 W71989 AI207360 AI188824 AI200559 
AI200270 AA644163 AI1 99943 AI151301 AI189555 AI262724 AI148590 AI14869S AI126906 AI149163 K03183 K03189 
AI189842 AI221014 N30608 AI186465 AI2208S5 AI188498 AI138226 AI189968 AI221019 AI138197 AI149426 AI148904 
AI186218 AI188348 AI160579 AI198460 AI149039 AI160936 AI2190S5 AI184784 AI221580 AI161082 AI160814 AI12389S 
AI417614AI126101 AI188872 AI149571 A1168533 AI149072 AI149467 AI131286 N306B4 AI160705 AI160B92AI149559 
AI273580 AI189442 AI138448 AI149591 N27302 AA400910 AI138431 AI138435 AI128407 N30216 AI128296 AI21 9589 
AI1B8492 AI149447 AI168482 H95374 AI219009 N31616 AI276216 N32233 AI291937 N30741 A1188689 N27111 R23214 
AI221605 AI184348 AI200375 H94451 N26397 AI8718B1 AA232905 N30833 AI220780 H94446 N30822 H87464 R68815 
N30290 AI128424 H125B7 T47334 H87631 H87156 AI219133 AI868741 AA330859 H86993 AA330413 H93656 N30817 
T90191 H93668 AI200054 H95207 T47316 H95381 T49170 R00880T49171 N27381 H94107 R63352T85053AW451899 
H95142 N30313 H94015 H86987 T28278 N29701 C18834 AA331267 AA330939 AI654493 N27073 N29831 R681 13 N30758 
R26086 N32108 H95135 AA330414 AA330978 AI219422 AI189453 AM99951 X00264NM_000894AA371909 AA0B3496 
T29543 AA371971 AA372026 AA371978 AA371346 AI051683 AI186418 AI220659 AI189068 AI219266 AI186552 AI188715 
AI149156 

AW794626 M27126 M27014 

J05581 M61170 T27692 M34088 M34089 AW860335 AW579047 AW610437 AW610386 AW610422 AW610473 AW579078 
AW604897 AW860163 AW579087 AW862410 AI816584 AW177757 AW602769 AI909790 AW860331 AI909787 AI90981 1 
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100818 19604.3 
100881 458J27 



126126 1630017_1 

102620 1682L37 

102673 24986_6 

102675 5145_4 



127071 188097J 
126456 291965J 
119388 1762256J 



AI909813 AW845083 AI905920 AW387919 BE140766 AI909279 AW369405 AA429321 M429320 AA367451 AA847972 
AW001 137 AI567905 T84561 A1631295 AA1 51351 H02932 AI884519 AA367457 AW369421 AI678846 AW391803 AI610869 
AW192838 AI922289 AI952140 A1910233 AI479474 AW001 395 AA488073 AI985760 AW130017 AI853369 AA627845 
AW081 805 AA158865 AI624443 AA344985 AA589793 R72486 AI589329 AI903204 AI269893 AA641 284 AI279932 AA149270 
AI69712Q AA729146 AI589353 AM80067 AI923310 AA530908 AI275395 AM25062 AA580280 AA889527 AA158866 
AW131341 AA573028 AA877326 T29335 AW951288 H04235 AA099243 AA994659 AI659618 AA887919 AI299297 
AW001 116 AW263844 AI270578 AA970828 AW572126 AA775299 AW369449 AW369398 AW369452 AI933677 AI870710 
AI09291 1 AI582464 AI497674 AA937026 AA885865 L38597 M908325 AW369432 AW026623 AAB27778 AI264942 
AA932409 AI187328 AI672970 AI886098 AW440471 AW138860 AI866858 AI802528 AI926172 AW243914 AI933690 
AA996114 AA536189 AW009937 AI918060 AI270379 AI973169 AW175638 AW369413 

NM_006227 L26232 R50649 AU077024 AL008726 AA411079 R35151 BE278153 BE278139 AI459777 R88036 Z43210 
F07326 AF052157 R17844 BE615476 T82160 R71985 H21963 AA299158 AW368246 R48123 R50628 R70441 H27245 
H72015 R72345 R39392 AI909738 BE612778 BE613234 D52116 D52136 D52132 D52067 D51922 D51995 D51905 N34249 
N25459 AA464436 AA297350 AA297466 R81736 H02737 AW582505 R27523 AI834241 AW130867 W72668 W76426 
AA358363 R50262 AW473860 H52335 H43953 H21964 T39505 AI887517 AW156925 AW839850 H02628 AW007705 
AI561008 F22392 R71279 AA995433 R50725 W24462 R71931 AA464437 AW591731 R25667 R52695 R50810 AI560805 
AI089266 H68386 H41353 H28590 AW001860 AI141623 AA250773 AI284778 AWS1U12 AW083975 AA130377 AW026047 
R50551 R81494 AI357668 AI078272 F32666 F36981 AW304865 H43906 AA931068 R48010 AI540217 AI017339 AI291812 
AI741954 AA458490 AI088378 AA298764 H61168 AA358362 AA298725 AA298515 AA464148 AA443538 R43046 AA084314 
T40641 T47608 T48940 AI082477 AW470145 N92284 AI758958 AA298512 AA284586 AI597777 AA480277 AI932559 
AI869081 AA47661 5 AA503651 AI656024 AW168522 AI682051 AI689106 AI274592 AI520917 BE258916 BE615861 
BE280282 R53386 BE278255 BE278398 T47607 AA477662 H68385 

100817 19648J L34355 L46810 NM_000023 U08895 AA424260 AI097272 AM241 62 N79764 F19290 F25278 AI479385 
AA460662 AA432059 AW016935 F25770 F32549 F36677 F33016 F35992 F36010 AW1 72497 AA835076 F28727 AA211643 
AA453282 

U79251 AA843851 R38201 R66461 R44908 AA683289 H17477 R37364 R52832 AW298336 AA351391 NNL0Q2545 L34774 
AA296886 AW967001 T28889 R13451 T77331 AL119196 AL1 18830 H08459 AW892812 AW905838 H17585 R52878 
BE561958 BE561728 BE397612 BE514391 BE269037 BE514207 BE562381 BE514256 BE514403 BE514250 BE397832 
BE269598 BE559865 BE396881 BE560031 BE514199 BE560037 BE560454 
X07881 NM_006249 X07637 AA376715 AA376677 X07715 X07704 S80916 

BE387614 R51501 AA199714 AW674779 F08178 BE269071 AA376313 H08264 AA380420 H18785 AL042151 BE277758 



BE388647 BE537373 BE547158 AW409585 AW374033 AW602185 AA355725 AW577548 AW935015 AW935160 W40232 
AW938647 AW374332 AA434040 BE293488 AL138361 BE560260 AI745075 AA317980 AW949382 A1834311 AI653582 
A1831042 AI361878 AA618606 AA729052 AI424969 AA199715 AW769374 AI828422 AW044307 AI862816 AI203583 
AW084461 AW514655 AA831883 AA290672 AA831286 AA578510 AW089965 AW150746 AA292743 H22232 AI469275 
AW439312 AA292744 AW471443 AI473989 AA593336 AM64070 AI678937 AW069451 AA970763 AA610480 AA593328 
AA464009 AA768985 AI298928 M436600 AA464718 AA699361 D61482 D55935 AI369591 AA470695 AI809135 AA640627 
AI568446 R51502 W45467 AI655316 AA463934 AW168609 AW518653 BE045525 Z41251 AI868091 AA908160 AI026697 
AIB86259 AI612932 AA215437 AI956014 BE541087 BE255652 BE265878 BE394102 W27502 
U48936 L36592 X87160 NM_001039 AL036606 AL036420 U35630 AW298574 
W80551 M85370 
AA976427 U66052 
AI457548 U72509 
U72512T98357 R31335F18090 

L32961 NM_000663 U80226 S75578 AA425061 AA429317 AI815143 AA91 0669 AI286022 AI286019 
U88896 U88898 AA916056 T03285 AI341 594 AI359534 AI634031 U88897 

BE397750 AA232171 BE562900 BE384894 BE242228 BE206819 BE261742 AA296468 AW959763 BE276164 BE264109 
BE392626 BE25B735 AA301453 N55872 H01676 AA292746 AA427485 AA496400 AA352389 

Y10518 Y10514 Z83935 Y10508 AK000055 Y10519 AI142012 AI681175 BE222219 AA890586 BE504347 BE328064 N63044 
N51226 AI151248 AI521996 AI924777 AW375954 AI860275 W00549 AI742673 AW612288 AI763062 AA632510 AI087347 
AI088070 AI214349 AA890297 AI494156 AI698598 AA631658 AA504593 AA860733 AI266761 AW663214 AW771231 
AA639610 AI769806 AI769746 AW014326 AI28861 1 



AA429212W00881 
T88798 R92430 
AI084125 AI083773 AI479687 AI939609 AI968662 AF 
AA827671 AA811055A/ 



AA321355 AW964592 R23284 H738B3 R23382 N47914 C01377 H04668 AW606248 R34447 AA84713S AI684489 AI523112 
AW044269 AI379138 N29366 AA761543 N79248 AA960845 AA768316 AI147926 AI718599 AI880620 R67467 AI216016 
AI738663H04648 

NM_001395 Y08302 A1434619 AI470328 AI261807 AW024965 AI806537 AI830549 A1640337 AI219065 AW271700 
AW028488 AI133339 AI859205 R51175 U87167 BE379324 BE392008 AA340819 AA3431 10 T57275 D59164 AW299312 
AI434422 AI936390 AW024975 R40262 

AW269126 R09430 T56590 AI367247 AI253132 BE464248 T58658 AW207785 T58607 
R51 194 AI732276 R53587 AIB20697 

AK000526 BE550O84 W30689 AW271859 AA411456 AI341551 AA242990 AA243027 H87046 D20360 AI184053 M146956 
AI721023 AI718944 AA146955 F18215 AA903890 AI700355 AI075430 AM11584 AA878210 AI476760 AW945637 AA630596 
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AA431522 AA301989 AI909058 D12149 N41960 BE222214 AA609922 AA828176 M393359 AA398693 AW024956 
BE467805 AW298623 AW264085 AI024454 AI024719 AI431927 T55087 AI61 1014 T54920 AA131253 AI436344 

114427 9724_2 AA017176 AI359979AA047836 AA017O63AAQ163O3AA0O1545 

114569 110077J AA063315 M063316 

100106 15621_-5 AF015910 

100515 342.1 AA305746 D90187T63943AW951154 T29182A1734941 D1 3264 AI299239Z1 8812 AW299859 W24476 AA933064 

AA489759 

100531 46038J AW888554 AW607282 AA319986 M28590 

100545 22955J1 M55405 AW752552 

100574 17320J AA326895 M10036 NM_000365 N84665 H69414NB4657 M380453AA329743 AA357367AA1 88770 AA376532 AA353653 

AA158953 AA083176 BE537313 AA181433 D53373 R57376 AA206698 R14807 H18899 H11191 H93892 R25593T61134 
N93285 AA083081 M831789 H13137 AA497014 AA079330 AA182861 H13138 W47161 R62913 AA687089 AA2111 12 
AA429237 AL035923 AA100070 AW392898 AI566433 AA866006 AA214002 AW392865 N79454 AA197181 AI680371 
AA176501 AA737967 AI089225 F34874 AW571437 AI620620 AA573489 AA423816 AA164917 AA458455 T47072 AI569087 
AI261656 M730919 AI633441 AW195182 AI351622 AW243465 AI872649 AI359227 AA987941 AI693770 T47073 AW779948 
AW510580 AI635626 AW627601 AA864326 AA953578 AI341418 BE222853 AI241 963 AI094663 AA928380 AA493373 
AW043762 AI377783 AW958987 BE619760 AA385240 BE277975 BE280095 AW631443 AA5B1048 BE618715 BE299610 
C14874 BE559858 BE378455 BE61 8290 BE544585 AI525575 BE548897 BE2671 10 AA804738 BE269821 AA918133 
BE277647 AA599947 BE280735 BE390239 N74150 T12504 AI208197 AW955527 AA1 13897 N40081 H73335 H70393 
AI434041 W22950 Al 1 92661 BE264461 W25486 AA626424 AA1 96694 T69209 AA857976 AI540287 AA41 0599 AA864287 
AW950564 AA013320 T49283 AI541438 AW804703 AA335534 AA335659 BE562269 BE618802 BE277850 BE546413 
BE280994 AA204813 BE561694 BE543524 BE253647 AW001452 W191 16 BE542508 AA205B94 BE254875 BE270033 
A1525906 BE251792 AA975700 BE272138 AW607671 N87686 M10036 BE515060 BE298607 AI745178 U47924 H03193 

100627 1igr_HT2798 Z25424 

100756 tigr_HT3768 M88357 

100768 fgr_HT3846 L29141 M69180 M81105 

100813 IIgr_HT4265 L33999 

100836 tigr_HT4383 U04688 

100855 tigr_HT4504 U09806 

102104 entrez_U12139 U12139 

125091 genbank_T91518 T91518 

100929 1igr_HT688 X65561 

125147 _entrez_W38150 W38150 

102354 entrez_U38268 U38268 

102491 entrez_U51010 U51010 

102636 entrez_U67092 U67092 

118769 genbank_N74496 N74496 

101046 entrez_K01160 K01160 

101057 entrez_K03430 K03430 

108334 genbank_AA070473 M070473 

108417 483241J AAO70853 AA075749 AA075716 

108441 genbank_AA079079 AA079079 

108786 genbank_AA128999 AA128999 

101655 entraz M50299 M60299 

101697 entrez_MS4358 M64358 

117437 genbank_N27645 N27645 

101798 entrez_MB5220 M85220 

101909 entrez_S69265 S69265 

103508 entrez_Y10141 Y10141 

103575 entrez_Z26256 Z26256 

119332 genbank_T54095 T54095 

112161 genbank_R48295 R48295 

119564 NOT_FOUND_entrez_W38206 W38206 

114376 NOT_FOUND_entraz_GMCSF GMCSF 

100478 tigr_HT1067 M22406 

100547 tigr_HT2219 M57417 

100564 tigr_HT2324 Z11585 
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TABLE 12: shows genes, including expression sequence tags, that are down-regulated in 
prostate tumor tissue compared to normal prostate tissue as analyzed using Affymetrix/Eos 
HuOl GeneChip array. Shown are the ratios of "average" normal prostate to "average" 
5 prostate cancer tissues. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Background subtracted normal prostate : prostate tumortissue 



UnigenelD Unigene Title 



Prolactin-lnduced Protein 



Hs.13743 
Hs.21948 
Hs.7413 



AA361094 
AA149204 
AA174173 
AA251741 



130376 
122484 
104142 



Hs.221128 
Hs.1 75783 
Hs.12622 



118068 t 
114251 2 
112134 f 
101436 f* 
104028 
103944 
103838 
120469 
110279 
127472 



102457 U48807 Hs.2359 

114011 W90385 Hs.16082 

101249 L33881 Hs.1904 

123265 AA491209 Hs.1 05265 



101673 M61908 

115586 AA399218 

120590 AA2817B0 

109748 F10192 

134727 X60507 

129171 AA234048 



104490 N71503 
102124 U14528 
109280 AA196635 
109707 F09739 
103087 AA045709 
135006 M21665 
119182 R80664 
129806 R62444 
101435 M20543 



Hs.6241 
Hs.92423 
Hs.111441 



Hs.8939 
Hs.7753 
Hs.1 11460 



ESTs 
ESTs 

Human alkali myosin light chain 3 mRNA; complete cds 
ESTs 

ESTs; Highly similar to growth arrest inducible gene product [H.sapiens] 
ESTs 

DKFZP586M1824 protein 

ESTs 

ESTs 

pallid (mouse) homolog; pallidin 
dual specificity phosphatase 4 
ESTs 

protein kinase C; iota 

ESTs; Weakly similar to reverse transcriptase [M.musculus] 

ESTs; Modly smlr to I! ALU SUBFAMILY SQ WARNING ENTRY !! [H.sapiens] 

phosphoinositide-3-kinase; regulatory subunit; polypeptide 1 (p85 alpha) 

ESTs 

ESTs; Weakly similar to similar to Kruppel-like zinc finger protein [C.elegans] 
Tubulin; alpha; brain-specific 
yes-associated protein 65 kDa 



7.212 
7.175 
6.701 
6.642 



5.5 
5.486 



Hs.1 65701 

Hs.40545 

Hs.929 

Hs.77067 

Hs.1 73373 

Hs.1288 

Hs.221184 



AA448286 
AA447006 
3 N32787 
3 284483 
3 Z40I86 
3 AA262491 
3 AA609749 
D H97993 



Hs.26409 
Hs.1 86572 
Hs.1 12759 
Hs.172788 



ESTs; Highly similar to multifunctional calcium/calmodulin-dependent protein 

kinase II deltas isoform [H.sapiens] a.4 

ESTs; Modly smlr to I! ALU SUBFAMILY SX WARNING ENTRY I! [H.sapiens] 5579 

ESTs; Weakly similar to dysferlin [H.sapiens] 5566 

salute carrier family 26 (sulfate transporter); member2 5.151 

ESTs 5.134 

Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 21 920 5.075 

ESTs 5.075 

myosin; heavy polypeptide 7; cardiac muscle; beta 5.055 

ESTs - 5.033 

KIAA0931 protein 4.675 

actin; alpha 1 ; skeletal muscle 4.626 
yt72d 2.r1 Soares retina N2b4HR Homo sapiens cDNA clone IMAGE275735 5', 4.6 

ESTs 4.559 

prolactin-induced protein 4.451 

ESTs 4.45 
ESTs; Weakly similar to !! ALU SUBFAMILY J WARNING ENTRY I! [Rsapiens] 4.45 

KIAA0432 gene product 4.301 

ESTs; Highly similar to atrophin-1 interacting protein 4 [Rsapiens] 4.2 

ESTs; Moderately similar to !! ALU SUBFAMILY SQ WARNING 4.175 

ESTs; Moderately similar to hypothetical protein 2 [H.sapiens] 4.1 
Human DNA sequence from PAC 46H23, BRCA2 gene region chromosome 13q12-134.05 

ESTs 4.05 

ESTs 4.048 

ESTs; Highly similar to unknown protein [R.noivegicus] 4.041 

ESTs; Weakly similar to KIAA0512 protein [H.sapiens] 4.026 
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120217 Z41078 


Hs. 66035 ESTs 




108536 AA084524 






134460 AA400030 


Hs 8360 ESTs* Weakly similar to !' ALU CLASS B WARNING ENTRY !! [H.sapiGns] 


3925 


120418 AA236010 


Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (from clone DKFZp586F1323) 




132783 N74897 


Hs.5683 DEADyH (Asp*Glu-Ala-Asp/His) box polypeptide 15 




125052 T80174 


Hs.222779 ESTs; Moderately similar to similar to NEDD-4 [H.sapiens] 




108600 AA099585 


Hs 41175 ESTs 


3.833 


103099 X61100 






134948 H06773 


Hs.93850 protein kinase; AMP-activated; gamma 2 non-catalytic submit 




120511 AA258144 


Hs. 221 576 ESTs 


3779 


111861 R37460 


Hs.25231 ESTs 




113966 W86600 


Hs.9842 ESTs 




131649 AA481254 


Hs.30120 ESTs 




129775 R94659 


Hs.12420 ESTs 




110191 H20568 


Hs.27182 phospholipase A2-activating protein 




112678 R87160 




37 


127115 AA375791 


Hs.131894 ESTs 




132892 W92797 


Hs. 59378 DKF2P434G1 62 protein 




115023 AA252079 


Hs.63931 dachshund (Drosophila) homolog 




114932 AA242751 


Hs.16218 KIAA0903 protein 


106865 AA487228 


Hs.19479 ESTs 




134480 AA024664 


Hs.83916 NADH dehydrogenase (ubiquinone) 1 alpha subcomplex; 5 (13kD;B13) 




124780 R42493 


Hs220839 ESTs 




130631 AA025399 


Hs.1 69737 ESTs 




134154 AA211320 


Hs.79404 neuron-specific protein 


3 568 


104160 AA455706 


Hs.997^2 to is, weaKiy similar to /oimj uluouoc mcuulmi tu rnui ci« 






PRECURSOR 




105524 AA258158 


Hs 22153 ESTs' Weakly similar to KIAA0352 [H sapiens] 


3.542 


110168 H19673 


Hs.176586 ESTs 




109480 AA233299 


Hs.72158 ESTs 




109585 F02367 


Hs.27252 ESTs 


35 


115134 AA257107 


Hs.1 94331 ESTs 




116083 AA455653 


Hs.44581 ESTs; Weakly similar to HEAT SHOCK 70 KD PROTEIN 6 [H.sapiens] 




120524 AA261852 


Hs.1 92905 ESTs 




116932 H74330 


Hs.150000 ESTs 


3 425 


130746 AA256976 


Hs.18800 ESTs; Weakly similarto KIAA0579 protein [H.sapiens] 


3.42 


107513 X05451 


Hs.158295 Human alkali myosin light chain 3 mRNA; complete cds 




118641 N70298 


Hs.49829 ESTs 


3407 


126584 AI028384 


Hs.127331 ESTs 




105134 M159953 


Hs22895 ESTs; Weakly similar to arylsulfatase B precursor [H.sapiens] 


3!325 
3.318 


123502 M600116 


Hs.1 12526 ESTS 


132389 N50866 


Hs.47135 ESTs 


3.317 


105691 AA287097 


Hs.75356 transcription factor 4 


3.315 


131505 H85897 


Hs27755 ESTs 


3.309 


120775 AA342104 


Hs.96777 EST 


3.3 


105579 AA278824 


Hs.19218 ESTs 


3295 


128190 AA946876 


Hs.148376 ESTs 


3292 


100819 HG4020-HT4290 Transglutaminase 


328B 
3273 


130217 D29956 


Hs.152818 ubiquitin specific protease 8 


130068 AA608903 


Hs.106220 KIAA0336 gene product 


3269 


134719 L07515 


Hs.89232 chromobox homolog 5 (Drosophila HP1 alpha) 


3266 


110277 H29209 


Hs.151231 ESTs; Highly similar to FYVE finger-containing phosphoinositide kinase [M.musculus] 3.2£ 


127354 AA4188B0 


Hs.1 85797 ESTs 


3212 


129173 R60523 


Hs.109087 ESTs 


3.197 


127464 AA970504 


Hs.146103 ESTS 


3.179 


124923 R94500 


Hs.108046 ESTs 


3.175 


122465 AA443164 


Hs.991 53 ESTs; Highly similar to CGI-73 protein [H.sapiens] 


3.151 


122027 AA431302 


Hs.98721 EST; Weakly similar to N-copine [H.sapiens] 


3.151 


103329 X85134 


Hs.72984 retinoblastoma-binding protein 5 


3.15 


129937 M95767 


Hs.135578 chitobiase; di-N-acetyl- 


3.15 


134197 AA057341 


Hs.87889 helicase-moi 


3.15 


107764 AA018219 


Hs.226923 ESTs 


3.125 


121775 AM21773 


Hs.161008 ESTs 


3.125 


114768 AA149007 


Hs.1 82339 Ets homologous factor 


3.12 


132381 N48818 


Hs.46884 ESTs 


3.11 


123105 AA485973 


Hs.143947 ESTs 


3.104 


121176 AA400080 


Hs.97774 ESTs 


3.1 


125053 T80620 


Hs.186473 ESTs 


3.075 


105909 AA401739 


Hs.5111 ESTs 


3.066 
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W72562 


Hs.58119 ESTs 


3.057 




AA424038 


Hs.58197 ESTs 


3.056 




R22988 


Hs.220950 ESTs 


3.05 




AA280047 


Hs.43948 ESTs 


3.05 




N6B412 


Hs.49189 ESTs 


3 




AA456598 


Hs.256269 ESTs 


2.995 


110415 


H48239 


Hs.29739 ESTs; Weakly similar to RAS-RELATED PROTEIN RAB-3A [H.sapians] 


2.979 




AA182030 


Hs.8364 ESTs 


2.978 


101102 


L07594 


Hs.79059 transforming growth factor; beta receptor III (betaglycan; 300kD) 


2.976 




H58383 


Hs.258544 ESTs 


2.976 




R24464 


Hs.202949 KIAA1 102 protein 


2.964 




HG4058-HT4328 Oncogene AmU-EvM , Fusion Activated 


2.957 


106822 


AA481063 


Hs.31835 ESTs 


2.95 




D11930 


Hs.3592 ESTs 


2.95 






Hs.15119 ESTs 


2.936 




T93795 


Hs.17252 EST 


2.917 




AA2 10695 


Hs.17212 ESTs 


2.917 




AA490227 


Hs.105252 ESTs 


2.904 


125250 


W87465 


Hs.222926 ESTs; Weakly similarto D2092.2 [C.elegans] 


2.9 




AA465160 


Hs.63386 ESTs 


2.9 




AA432243 


Hs.41086 ESTs; Weakly similarto OXYSTEROL-BINDING PROTEIN [H.sapiens] 


2.896 


111712 


R22905 


Hs.1 13716 ESTs 


2.895 




W78107 


Hs.1 87698 ESTs; Weakly similar to Yer140wp [S.cerevisiae] 


2.895 




N64378 


Hs.13149 ESTs; Highly similar to unknown function [H.sapiens] 


2.894 




AA280300 


Hs.1 91 346 ESTs 


2.886 




AA152263 


Hs.1 8827 KIAA0849 protein 


2.883 


129486 


H03686 


HS.2206B9 Ras-GTPase-aotivating protein SH3-domaln-binding protein 


2.879 




W73788 


Hs.43213 ESTs 


2.875 




R59881 


Hs.7503 ESTs 


2.871 




AA028171 


Hs.153688 ESTs 


2.868 


128420 


AI088155 


Hs.14146 ESTs; Weakly similarto unknown [H.sapiens] 


2.866 




AA280738 


Hs.1 28679 ESTs 


2.863 




AA482019 


Hs.238202 EST 


2.86 


129462 


D84239 


Hs.1 1 1732 IgG Fo binding protein 


2.856 


125166 


W45491 


Hs.172609 nucleobindin 1 


2.854 




W01626 


za36e07.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 






AA227972 


Hs.43635 ESTs 


2.85 


105077 


AA142919 


Hs.5558 ESTs 


2.847 




R34531 


Hs.92200 KIAA0480 gene product 




121080 


AA398720 


Hs.1 77953 ESTs 


2.838 




R73816 


Hs.1 7385 ESTs 


2.836 




R26206 


Hs.153293 KIAA0701 protein 


2.825 




AA427783 


Hs.77910 3-hydroxy-3-methylgIutaryl-Coenzyme A synthase 1 (soluble) 


2.816 




AA504356 


Hs.88650 ESTs 


2.813 




AA280627 




2.806 




U48736 
H20332 


Hs.198891 serine/threonine-protein kinase PRP4 homolog 

Hs.28707 signal sequence receptor; gamma (translocon-associated protein gamma) 


2.8 


115054 


AA252863 


Hs.87729 ESTs 


2.795 




AA063546 




2.792 


124308 


H93575 


Hs.227146 Homo sapiens mRNA; cONA DKFZp564J142 (from clone DKFZp564J142) 


2.783 




AA732329 


Hs.1 91 959 ESTs 




114800 


AA159825 


Hs.1 31 887 ESTs; Weakly similar to ORF YNL2270 [S.cerevisiae] 


2768 


128625 


AA242816 


Hs.1 02652 ESTs; Weakly similarto KIAAQ437 [H.sapiens] 


2.766 




H51098 


Hs.1 51 31 0 PDZ domain protein (Drosophila inaD-like) 


2.75 




AA620504 


Hs.22119 ESTs 


2.742 




T90746 


Hs.1 5233 ESTs 


2.734 


104639 


AA004622 


Hs.18214 ESTs 


2.727 


127609 


AA622559 


Hs.1 50318 ESTs 


2.726 


106922 


AA4909S4 


Hs.10056 ESTs 


2.725 


124825 


R52088 


yg85c3.s1 Soares infant brain 1NIB Homo sapiens cDNA clone 


2.725 


124333 


H98683 


Hs.1 54054 ESTs 


2.708 


117634 


N36421 


Hs.1 07854 ESTs; Weakly similar to SODIUM- AND CHLORIDE-DEPENDENTGLYCINE 








TRANSP 


2.706 


101609 


M54927 


Hs.1787 proteolipid protein 1 (Pelizaeus-Merzbacher disease; spastic paraplegia 2; 


2.704 






uncomplicated) 


117142 


H96908 


Hs.42251 ESTs 


2.7 


112602 


R79147 


Hs.203365 ESTs 


2.695 


106828 


AA481505 


Hs.1 3797 ESTs 


2.68 


124377 


N25996 


HS.179B33 ESTs 


2.675 
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101026 J04970 


carboxypeptkiase M 




124660 N66393 


Hs.1 02754 ESTs 




124066 H02494 


Hs.101615 ESTs 




1302B1 R12777. 


Hs.1 5395 ESTs; Weakly similar to ARGINYL-TRNA SYNTHETASE [H.sapiens] 




110949 N49602 


Hs.13308 ESTs 


2 65 


111031 N54839 


Hs.221085 ESTs; Highly similar to mediator (H.sapiens] 




121770 AA421714 


Hs.1 1469 KIAA0896 protein 


9M 


134132 U32519 


Hs.220689 Ras-GTPase-activating protein SH3-domain-binding protein 




112424 R62452 


Hs.1 91265 ESTs 




122544 AA451679 


Hs.194410 ESTs 




134425 X90568 


Hs.1 72004 titin 


2B24 


111114 N63391 






116119 AA459242 


Hs.44445 ESTs; Weakly similar to Kelch motif containing protein [H.sapiens] 




112079 R44164 


Hs.23014 ESTs 




123033 AA481271 


Hs.1 93945 ESTs 




124196 H52617 


Hs.144167 ESTs 


2536 


125873 H 14437 


yl25a04.rt Soares breast 3NbHBst Homo sapiens cDNA clone 


2.53 


117684 N40184 


Hs.45050 ESTs 


2.575 


134938 D30037 


Hs.168326 phosphotidylinositol transfer protein; beta 


2.575 


131822 AA215647 


Hs.200332 ESTs 


2.568 


135185 U71203 


Hs.96038 Ric (Drosophila)-like; expressed in many tissues 


2.554 


117690 N40467 


Hs.93834 ESTs 


2.557 


118807 N78582 


Hs.50732 protein kinase; AMP-activated; beta 2 non-catalytic subunit 


2.552 


121369 AA405657 


Hs.128791 Human DNA sequence from clone 967N21 on chromosome 20p12.3-13. Contains 


2.55 


114860 AA235112 


Hs.106227 ESTs; Moderately similar to similar to murine RNA-binding protein [H.sapiens] 


2.549 


121857 M426017 


Hs.62694 ESTs; Highly similar to DNA-REPAIR PROTEIN COMPLEMENTING 


2.548 


110190 H20560 


Hs.244624 ESTs 


2.548 


132573 AA045333 


Hs.51743 ESTs; Weakly similarto !! ALU SUBFAMILY SB2 WARNING ENTRY !! [H.sapiens] 2.542 


109706 F09729 


Hs.12780 ESTs 




135109 AA410391 


Hs.94592 klotho 


2.525 


132810 R37027 


Hs.5737 KIAA0475 gene product 


2.525 


124879 R73588 


Hs.101533 ESTs 


2.525 


103840 AA174190 


Hs.50932 ESTs 


2.525 


119056 R22196 


Hs.34492 ESTs 


2.519 


114833 AA234362 
112998 T23555 


Hs.87310 ESTs; Moderately similar to CGI-66 protein [H.sapiens] 
Hs.103288 ESTs 


2.507 
2.5 


123312 AA496258 


Hs.99S01 ESTs 


2.499 


121873 AA426270 


Hs.145696 splicing factor (CC1.3) 


2.491 


123321 AA496884 


Hs.23972 ESTs 


2.491 


107760 AA018042 


Hs.95078 EST 


2.433 


102580 U60808 




2.481 


103053 X56741 


Hs.5947 met transforming oncogene (derived from cell line NK14)- RAB8 homolog 


2.475 


124756 R38100 


Hs.106294 ESTs 


2.475 


112936 T15665 


Hs.6185 ESTs;WeaklysimilartoBcDNA.GH12174[D.melanogasterl 


2.475 


125178 W58202 


Hs.125731 ESTs 


2.475 


112423 R62447 


Hs.22123 ESTs 


2.471 


123515 AA600323 


Hs.1 12535 EST 


2.462 


102842 U95020 


Hs.21903 calcium channel; voltage-dependent; beta 4 subunit 


2.457 


102400 U42390 


Hs.171957 triple functional domain (PTPRF interacting} 


2.455 


113187 T56056 


Hs.9992 ESTs 


2.452 


131687 L11066 


Hs.3069 heat shock 70kD protein 9B (mortalin-2) 


2.448 


115314 AA280583 


Hs.256501 ESTs 


2.437 


128211 AI206427 


Hs.166707 ESTs; Highly similar to Ran-binding protein 2 [H.sapiens] 


2.43 


134281 L11005 


Hs.81047 aldehyde oxidase 1 


2.425 


115985 AA447709 


Hs.132094 ESTs; Moderately similar to putative transcription factor CA150 [H.sapiens] 


2.425 


111348 N90041 


Hs.9585 ESTs 


2.418 


129430 AA258B42 


Hs.197877 Homo sapiens clone 23777 putative transmembrane GTPase mRNA; partial cds 


2.418 


133883 C13990 


Hs.76930 synucleln; alpha (non A4 component of amyloid precursor) 


2.417 


111164 N66857 


Hs.14808 ESTs; Weakly similarto !! ALU CLASS C WARNING ENTRY !! [H.sapiens] 


2.416 


132143 AA257056 


Hs.7972 KIAA0871 protein 


2.412 


130330 M55047 


Hs.154679 synaptotagmin 1 


2.408 


114219 Z39451 


Hs.27389 ESTs 


2.406 


117101 H94043 


Hs.24341 DKFZP58611419 protein 


2.403 


125433 M034325 


Hs.54320 ESTs 


2.4 


111099 N62506 


Hs.21958 ESTs 


2.4 


120323 AA195405 


Hs.1 10347 Homo sapiens mRNA for alpha integrin binding protein 80; partial 


2.397 


118624 N69996 


Hs.21801 ESTs 


2.394 


123570 AA608955 


Hs.1 09653 ESTs 


2.389 


123562 AA608893 


Hs.190065 ESTs 


2.388 
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131546 AA262821 


Hs 28578 muscle blind (Drosophila)-like 




103143 X66141 


Hs 188691 ESTs" ' ^ ' ^ ^ 1 


2 384 


1 23645 AA60931 0 






130123 AA001835 








Hs.30654 ESTs 










125168 W45574 






123973 C 14805 


Hs.182151 ESTs 




135197 U76456 












107734 AA016225 


H 93385 EST 


2354 




Hs.4138 1 cols; weakly similar to UDiquiun hydrolyzing enzyme I [H.sapiensj 




111163 N66B50 






112349 R58877 


Hs 22665 ESTs* Moderately similar to dJ83L6 1 [H sapiens] 










134238 R81509 






116766 H13260 


Hs.95097 ESTs 




106331 AA436853 


Hs 34795 ESTs 




129003 AA443752 


Hs. 10784 ESTs 


2 332 


132368 AA599814 


Hs.46637 ESTs; Weakly similar to cDNA EST yk289g5 5 comes from this gene [C.elegans] 




124697 R06273 


Hs.1 86467 ESTs; Modiy smlrto !! ALU SUBFAMILY J WARNING ENTRY !! {H, sapiens] 




120273 AA1 76688 


Hs 221 139 ESTs 




127110 AA304993 


ns. luuoD i to i s, weawy similar 10 pou Kaianin [n.sapransj 




105450 AA252621 


Hs.93842 ESTs 




119819 W74371 


Hs.58383 ESTs 




102302 U33052 


Hs.69171 protein kinase C-Iike 2 




130596 N74353 


Hs 16475 ESTs 




114161 Z38904 


Hs.22385 ESTs; Weakly similar to KIAA0970 protein [H.sapiens] 






Human sperm membrane protein BS-63 mRNA, complete cds 




104491 N71513 


Hs 39328 ESTs 




116988 H82527 


ys69e12.s1 Soares retina N2b4HR Homo sapiens cDNA clone 








2 273 


108800 AA129731 


HSS0424 ESTs' 


2.273 


101310 L41607 


Hs.934 glucosamine (N-acetyl) transferase 2; l-branching enzyme 


2.269 


126842 W 19498 


Hs.21085 ESTs 


2.255 




Hs.128638 ESTs 


2.251 


124647 N91947 


Hs.125033 ESTs 


2.249 


127112 AI143906 


Hs.125103 ESTs 


2547 


101973 S82597 


Hs.80120 UDP-N-acetyl-alpha-D-galaclosamine:polypeptide 


2.243 


120999 AA398302 


Hs.127437 ESTs 


2.245 


130225 AA599583 


Hs.15299 HMBA-indudble 


2.243 


119980 W88678 


Hs549247 heterogeneous nuclear protein similar to rat helix destabilizing protein 


2.243 


124222 H61053 


Hs.222844 ESTs 


254 


129199 H 9091 4 


Hs.128629 ESTs 


2536 


106802 AA479101 


Hs.16570 ESTs; Weakly similar to !! ALU SUBFAMILY SQ WARNING ENTRY !! [H.sapiens] 2531 




Hs.247277 ESTs; Weakly similar to transformation-related protein [fisapiens] 


2.229 


104627 AA001976 


Hs.19603 ESTs 


2.223 




Hs.42484 Homo sapiens mRNA; cDNA DKFZp564C053 (from clone DKf=Zp564C053) 


2.226 




Hs.8345 ESTs 


2.225 


135336 AA452822 


Hs.99027 ESTs 


2.225 


135344 R62976 


Hs.168491 ESTs; Moderately similar to TRF1-interacting ankyrin-related 


2.225 




Hs.118443 ESTs; Moderately similar to AKT3 protein kinase [H.sapiens] 


2.222 


128885 AA397841 


Hs.180141 cofilin 2 (muscle) 


2513 


107900 AA026385 


Hs.176600 ESTs; Moderately similar to I! ALU SUBFAMILY SB2 WARNING 


2.217 


1 1 446 1 AA033562 


Hs.151572 ESTs 


2512 


109292 AA1 99828 


Hs.188662 ESTs 


2.212 


104257 AF006265 


Hs.9222 estrogen receptor-binding fragment-associated gene 9 


2.209 


132932 T15482 


Hs.6093 ESTs 


2504 


127392 AA262728 


Hs.14896 Homo sapiens clone 24590 mRNA sequence 


2504 


104641 AA004652 


Hs.18564 ESTs 


25 


122529 AA449828 


Hs.99229 ESTs 


2.195 


124307 H93562 


Hs.162395 proline synthetase co-transcribed (bacterial homolog) 


2.193 


133601 S95936 


Hs.75155 transferrin 


2.193 


119904 W85709 


Hs.128927 ESTs; Weakly similar to !! ALU SUBFAMILY SP WARNING ENTRY I! [H.sapiens] 2.192 


100348 064109 


Hs.4994 transducer of ERBB2; 2 (TOB2) 


2.185 


126871 AA351779 


Hs500334 ESTs 


2.18 


127793 AI298835 


Hs.30445 ESTs; Weakly similar to transcription regulator Staf-50 (H.sapiens] 


2.178 


105149 AA169253 


Hs.8958 ESTs 


2.177 


121367 AA405648 


zw39g8.s1 SoaresJotalJetus_Nb2HF8_9wH sapiens cDNA clone IMAGE:772478 2.177 
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111836 R36228 






133394 R 16759 


Hs.237225 nbosomal protein So pseudogene 1 




123207 AA489697 


Hs.145053 ESTs 




129801 F11087 


Hs.239666 ESTs 




103393 X94612 


Hs.41749 protein kinase; cGMP-dapendent; type II 




132415 AA043223 




2157 


106369 AA443828 


Hs.25324 ESTs 




122963 AA478446 


Ms 69559 KIAA1096 protein 


2156 


133473 M 19309 


Hs 73980 troponin T1 * skeletal* slow 




134257 C06270 






135156 AA056012 


Hs.9552 binder of An Two 




104055 AA393755 






102313 U33921 


HSU33921 Clontech adult lung cDNA library (HL1158a) Homo sapiens cDNA 




109788 F10638 




2 15 


103507 Y10032 






116000 AA448710 


Hs.41327 ESTs 




105858 AA399164 


Hs.227676 ESTs; Moderately similar to !! ALU SUBFAMILY SQ 




103153 X66534 


Hs.75295 Quanylate cyclase 1 ; soluble; alpha 3 




126202 AA652238 




2 135 


115955 AA446121 






104164 AA458770 






108692 AA121270 


Hs.82960 ESTs 




122878 AA465341 


Hs.99640 ESTs 




134771 L13939 


Hs.89576 adaptor-related protein complex 1 ; beta 1 subunit 




104298 D31120 


Hs.40368 adaptor-related protein complex 1 ; sigma 2 subunit 




104840 AA039595 






122180 AA435798 






131012 H01992 


Hs.202949 KIAA1 102 protein 




134092 H17490 


Hs.7905 ESTs; Highly similar to sorting nexin 9 [Ksapiens] 




118617 N69666 


Hs.183413 ESTs; Modtly smlr to !! ALU SUBFAMILY J WARNING ENTRY !! [H.sapians] 




107155 AA621202 


Hs.7946 DKFZP586D1519 protein 


212 


130925 N71935 


Hs.1 69378 multiple PDZ domain protein 




135167 U63717 


Hs.95B21 osteoclast stimulating factor 1 




105952 AA405263 


Hs.1 81400 ESTs 




110308 H38148 


Hs.32775 ESTs 


2108 


116368 AA521186 


Hs.94217 ESTs 




132939 U76189 




2102 


117881 N50073 


Hs.84926 ESTs; Highly similar to B-IND1 protein [M.musculus] 




121723 AA419622 


Hs.104800 ESTs; Weakly similar to Mouse 195 mRNA; complete cds [M.musculus] 




103500 Y09443 


Hs-22580 alkylglycerone phosphate synthase 




121429 AA406293 


Hs.1 93498 ESTs 




134632 AA398710 


Hs.1 741 39 chloride channel 3 




129785 F10980 


Hs.184780 ESTs 


209 


111065 N58193 






114710 AA129931 


Hs.79081 protein phosphatase 1 ; catalytic subunit; gamma isotorm 




132711 N73702 


Hs.238927 ESTs 




133377 R05490 


Hs.7239 SEC24 (S. cerevisiae) related gene family; member B 




124773 R40923 


Hs.1 06604 ESTs 


2 078 


117759 N47587 


Hs.97345 ESTs; Weakly similar to TROPOMODUL1N [H.sapiens] 




127386 AI457411 


Hs,1 06728 ESTs 


2076 


101167 L15309 


Hs.1 93677 zinc finger protein 141 (clone pHZ-44) 




109597 F02582 


Hs.14474 ESTs 




124390 N29325 


Hs.7535 ESTs; Highly similar to COBW-iike placental protein [Hisapians] 




116225 AA478609 


Hs.47278 Human Chromosome 16 BAC clone CIT987SK-A-735G6 




131243 R16667 


Hs.24752 spectrin SH3 domain binding protein 1 




130557 T90830 


Hs.15981 ESTs; Weakly similar to line-1 protein ORF2 [H.sapiens] 




134103 D14826 


Hs.155924 cAMP responsive element modulator 


2.064 


108833 AA131866 


Hs!61661 ESTs; Weakly similarto DY3.6 [G.elegans] 


2.063 


112286 R53765 


Hs.158135 K1AA0981 protein 


2.063 


125624 AA165411 


zq49a01 .M Stratagene hNT neuron (#937233) Homo sapiens cDNA clone 


2.061 


124612 N72200 


Hs.13913 ESTs 


2.058 


116335 AA495830 


Hs.87013 ESTs 


2.057 


112248 R51361 


Hs.23423 ESTs 


2.056 


115789 M424754 


Hs.43149 ESTs 


2.056 


107029 AA599219 


Hs.187492 ESTs; Weakly similarto ALR (H.sapiens] 


2.056 


110294 H30270 


Hs.165062 ESTs 


2.054 


120532 AA262354 


Hs.186648 ESTs 


2.054 


118180 N59249 


Hs.48349 ESTs 


2.052 


132018 AA293194 


Hs.3737 ESTs 


2.052 
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132617 AA171913 


Hs.5338 carbonic anhydrase XII 




131526 N36167 


Hs.28274 ESTs 


2.05 


113254 T64438 


Hs.1 1449 DKFZP5640123 protein 




122785 AA459978 


Hs.99508 ESTs 




107203 D20426 


Hs.5656 EST 


2.05 


105713 AA291321 


Hs.184319 ESTs; Moderately similar to KIM1006 protein [H.sapiens] 




129385 D82675 


Hs.1 10950 Homo sapiens clone 25007 mRNA sequence 




119116 R43845 


Hs.64595 DKFZP566E2346 protein 




116405 AA600253 


Hs.55601 ESTs; Highly similar to host cell factor 2 [H.sapiens] 




125924 AA526849 


Hs.82109 syndecan 1 




105599 AA279442 


Hs.143460 protein kinase C; nil 




119741 W70205 


Hs.43670 kinesin family member 3A 




101449 M21494 


Hs.1 18843 creatine kinase; muscle 




107109 AA609943 


Hs.32793 ESTs 




117040 H89112 


yw25e5.s1 Morton Fetal Cochlea Homo sapiens cDNA clone IMAGE:25328 




132906 AA142857 


Hs.234896 ESTs; Highly similar to geminin [H.sapiens] 




105479 AA255546 


Hs.23467 ESTs 




102031 U04898 


Hs.2156 RAR-related orphan receptor A 




119846 W80363 


Hs.58446 ESTs 




124809 R46482 


Hs.1 06875 ESTs 




130286 M041548 


Hs.154023 K1AA0573 protein 




124457 N50114 


Hs.128704 ESTs 




125144 W37999 


Hs.24336 ESTs 




120581 AA281257 


Hs.125868 ESTs 




104931 AA062731 


Hs.108319 thyroid hormone receptor-associated protein; 150 kDa subunit 




120548 AA278B46 


Hs.187634 ESTs 




113933 W81362 


Hs.30567 ESTs 




123072 AA485041 


Hs.1 04308 ESTs 




123648 AA609323 


Hs.1 12689 ESTs 


2.003 


116875 H67749 


Hs.161022 EST 




103179 X69398 


Hs.82685 CD47 antigen (Rh-related antigen; integrin-associated signal transducer) 




103478 Y07755 


Hs.38991 S100 calcium-binding protein A2 


1.995 


111007 N53378 


Hs.22543 ESTs 


1.995 


120470 AA251797 


zs11f3.s1 NCI CGAP GCB1 Homo sapiens cDNA clone 


1.989 


112280 R53457 


Hs.26040 ESTs; Weakly similar to fatty acid omega-hydroxylase [H.sapiens] 


1.989 


114127 Z38652 


Hs.1 06961 ESTs; Weakly similar to TYL [H.sapiens] 


1.988 


129863 AA151005 


Hs.129872 sperm surface protein 


1.988 


106320 AA436608 


ESTs 


1.988 


108933 AA1 47224 


Hs.71814 ESTs 


1.986 


105906 AA401633 


Hs.22380 ESTs 


1.982 


109029 AA1 57911 


Hs.72200 ESTs 


1.982 


118470 N66769 


Hs.82781 ESTs 


1.975 


11535B AA281886 


Hs.88923 ESTs 


1.975 


115257 AA279060 


Hs.193516 B-cellCLL/lymphoma10 


1.974 


126879 AA719776 


Zh38g04.s1 Soares_pineal_gland_N3HPG Homo sapiens cDNA clone IMAGE:4143901.9" 


109547 F01479 


Hs.26966 ESTs 


1.973 


127111 AA805726 


Hs.220509 ESTs 


1.969 


101266 L36645 


Hs.73964 EphM 


1.966 


129319 AA037467 


Hs.30340 ESTs 


1.955 


106211 AA428240 


Hs.126083 ESTs 


1.962 


112753 R93696 


Hs.169882 ESTs 


1.961 


120489 AA255538 


Hs.190504 ESTs 


1.959 


129699 AA458578 


Hs.12017 KIAA0439 protein; homolog of yeast ubiquitin-protein ligase Rsp5 


1.956 


105425 AA251129 


Hs.24416 ESTs 


1.953 


134740 L37362 


Hs.89455 opioid receptor; kappa 1 


1.95 


109324 AA210700 


Hs.86405 Homo sapiens mRNA; cDNA DKFZp564P056 (from clone DKFZp564P056) 


1.95 


124303 H93043 


Hs.1 07070 ESTs 


1.95 


102337 U36922 


Human fork head domain protein (FKHR) mRNA, 3' end 


1.948 


109441 AA228100 


Hs.86998 nuclear factor of activated T-calls 5 


1.946 


127364 Ml 79573 


Hs.90061 progesterone binding protein 


1.942 


105255 AA227498 


Hs.3623 ESTs 


1.942 


130672 L19783 


Hs.177 phosphatidylinositol glycan; class H 


1.942 


104301 D45332 


Hs.6783 ESTs 


1.94 


132442 R62589 


Hs.167419 ESTs 


1.939 


105519 AA258063 


Hs.23438 ESTs 


1.937 


132902 AA490969 


Hs.168147 ESTs 


1.936 


118873 N89B81 


Hs.44577 ESTs 


1.936 


114124 Z38595 


Hs.125019 ESTs; Highly similarto KIAA0836 protein [H.sapiens] 


1.934 


115075 AA255486 


Hs.88045 ESTs 


1.933 
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110695 H93483 


Hs.124777 ESTs 


1.931 


105360 AA236209 
124998 T56013 


Hs.1 87626 ESTs 

Hs.77910 3-hydroxy-3-methylg[utaryl-Cosnzyme A synthase 1 (soluble) 


1.931 
1.929 


121816 AA424814 


Hs.1 87509 ESTs 


1.927 


111717 R23241 


Hs.1 10776 STAT induced STAT inhibitor-2 


1.925 


128874 H06245 


Hs. 106801 ESTs 


1.925 


109391 AA219699 


Hs.1 84245 KIAA0929 protein Msx2 interacting nuclear target (MINT) homolog 


1.913 


126129 H82165 


Hs.40334 ESTs 


1.911 


115553 AA369027 


Hs.71414 ESTs 


1.905 


113811 W44928 


Hs.4878 ESTs 


1.905 


108345 AA07Q906 


zm65d1 .si Stratagene neuroepithelium (#937231 ) Homo sapiens cDNA clone 


1.904 


120472 AA251875 


Hs.104472 ESTs; Weakly similar to Gag-Pol polyprotein [M.musculus] 


1.903 


116602 D80063 


Hs.241673 EST 


1.901 


121121 AA399371 


Hs.189095 ESTs; Weakly similar to zinc linger protein SALL1 [H.sapiens] 


1.9 


125330 AA401804 


Hs.1 14574 ESTs 


1.896 


130095 F01831 


Hs.1 4838 ESTs 


1.894 


119782 W72982 


HS.58262 ESTs 


1.894 


104115 AA428090 


Hs.26102 ESTs 


1.893 


131313 C17938 


Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from clone DKFZp564O0122) 


1.891 


105563 AA278907 


Hs.24549 ESTs 


1.891 


122825 AA461195 


Hs.99580 ESTS 


1.887 


119495 W35390 


Hs.55533 ESTs 


1.886 


130309 AA134289 


Hs.15423 Homo sapiens BAC clone RG114B19from7q31.1 


1.886 


125628 AA418069 


Hs.241493 natural killer-tumor recognition sequence 


1.886 


110611 H66947 


Hs!l 4671 ESTs; Highly similar to gene ERCC5 protein [H.sapiens] 


1.885 


117301 N22569 


Hs.43215 ESTs 


1.884 


131406 N92239 


Hs.26471 Wnt inhibitory factor-1 


1.881 


126428 AAQ13312 


Hs.64988 ESTs 


1.881 


120285 Ml 82882 


Hs.1 1 1 1 10 titin-cap (telethonin) 


1.878 


112724 R91753 


Hs.1 7757 ESTs 


1.878 


103121 X63679 


Hs.4147 translocating chain-associating membrane protein 


1.875 


124381 N26765 


Hs.1 09008 ESTs 


1.875 


117226 N20468 


Hs.1 77322 ESTs; Weakly similar to putative p150 [Rsapiens] 




105610 AA279991 


Hs.124691 ESTs; Weakly similar to trithorax homotogue 2 [H.sapiens] 


1.875 


111229 N69113 


Hs.1 10855 ESTs 


1.875 


120627 AA285079 


Hs.1 90474 ESTs 


1.873 


107048 AA600012 


Hs.10669 ESTs; Moderately similar to KIAA04Q0 [H.sapiens] 


1.872 


104041 AA381902 


Hs.197114 RNA binding protein 


1.872 


115162 AA258368 


Hs.227806 ras GTPase activating protein-like 


1.872 


102239 U26726 


Hs.1376 hydroxysteroid (1 1 -beta) dehydrogenase 2 


1.87 


100043 M10098 


AFFX control: 18S ribosomal RNA 


1.863 


120296 M191353 


Hs.22385 ESTs; Weakly similar to KIAA0970 protein [H.sapiens] 


1.867 


129011 S72869 


Hs.1 07932 DNA segment; single copy; probe pH4 (transforming sequence; thyroid-1 ; 


1.867 


134851 R44479 


Hs.90232 KIAA0552 gene product 


1.865 


117392 N26175 


Hs.93405 ESTs 


1.864 


114530 M053027 


Hs.1 91797 ESTs 


1.863 


123541 AA608794 


Hs.1 12592 ESTs 


1.863 


124890 R78618 


Hs.34145 ESTs; Weakly similar to RAS-RELATED PROTEIN RAB-8 [H.sapiens] 


1.862 


105299 AA233511 


Hs.1 94720 ATP-binding cassette; sub-family G (WHITE); member 2 


1.861 


103560 Z20556 


Hs.182787 myosin; heavy polypept 6; cardiac muscle; alpha (cardiomyopathy; hypertrophic 1) 1.861 


113073 T33S37 


Hs.6841 ESTs 


1.86 


120407 AA235040 


Hs.107283 ESTs 


1.859 


103892 AA243523 


Hs.1 7155 ESTs 


1.858 


123795 AA620381 


Hs.70488 ESTs 


1.857 


108524 M084323 


Hs.68138 ESTs 


1.857 


113953 W85812 


Hs.1 87554 ESTs 


1.856 


110721 H97678 


Hs.31319 ESTs 


1.858 


129426 AA412087 


Hs.1 68272 EST; Highly smlr to prot inhibitor of activated STAT prot PIASx-alpha [H.sapiensl 


1.853 


112102 R44840 


Hs.21303 ESTs 


1.852 


118502 N67317 


Hs.50150 ESTs 


1.852 


107619 M004955 


Hs.60015 ESTs 


1.851 


100436 D87446 


Hs.75912 KIAA0257 protein 
Hs.1 91648 ESTs 


1.85 


120652 AA287312 


1.85 


121643 M417078 


Hs.1 93767 ESTs 


1.843 


117387 N26011 


Hs.53810 ESTs 


1.843 


132084 Y12394 


Hs.3886 karyopherin alpha 3 (importin alpha 4) 


1.843 


124449 N48593 


Hs.121820 ESTs 


1.841 


120263 M173440 


Hs.193919 ESTs 


1.838 


127226 M731036 


Hs.3463 ribosomal protein S23 


1.833 
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7 R36447 
7 M64174 
9 M018937 



126579 V 
102341 I 
114246 Z 
131757 C 
108904 A 



122537 
114359 
111289 
110826 
104106 
130043 



124477 
100782 
106101 
115479 
116104 
114173 



125904 
133883 
105777 



Hs.50651 Janus kinase 1 (a protein tyrosine kinase) 
Hs.128629 ESTs 

Human nebulin mRNA, partial cds 
Hs.146082 ESTs 
Hs.81 10 adducin 3 (gamma) 
Hs.12079 ESTs 

Hs.316 OEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 6 (RNA halicase; 54kD) 
Hs.71148 ESTs; Weakly similar to putative p150 [H.sapiens] 
Hs.42484 Homo sapiens mRNA; cDNA DKFZp564C053 (from clone DKFZp564C053) 
7 AA6U9UU8 Hs.183232 ESTs 
1 D12485 Hs.11951 phosphodiesterase l/nucleotide pyrophosphatase 
1 (homologous to mouse Ly-41 antigen) 

3 H30539 Hs.189838 ESTs 

4 N59959 Hs.48443 ESTs 

7 M016021 Hs.173091 DKFZP434K151 protein 
7 D78156 Hs.241548 RAS p21 protein activator 2 

5 M489016 Hs.91216 ESTs; Highly similar to partial CDS; human putative tumor suppressor [H.sapiens] 
3 R54112 Hs.128697 ESTs 

Hs.6968 ESTs 

Hs.153483 ESTs; Moderately similar to H1 chloride channel [Rsapiens] 

6 ESTs 

ESTs 
ESTs 

ESTs; Weakly similar to I! ALU SUBFAMILY J WARNING ENTRY !! [H.sapiens] 
HS-diaw ESTs 
Hs.122684 ESTs 
Hs.102682 ESTs 
101 0 Basic Transcription Factor 2, 34 Kda Subunit 

Hs.34395 ESTs 

zs52h09.s1 NCI_CGAP_GCB1 H sapiens cDNA clone IMAGE:701153 
Hs.78524 ESTs 
Hs.21963 ESTs 

Hs.5398 guanine-monophosphate synthetase 
death effector domain-containing 
ESTs; Weakly similar to F55A12.9 [C.elegans] 
ESTs 
ESTs 

nudeDporin 153kD 
ESTs 
ESTs 

choline kinase 



AA422123 
AA055404 
AA432Q80 
AA05614O 
N53158 



2 N59764 
5 R49548 

3 N91087 
2 AA177138 
5 N25427 
1 Z25535 

AA406367 
H22372 
AA397915 
AA348412 
H19480 



115278 
133104 
131170 
100136 



N48674 
D13540 
AA331157 



5 N69S82 
7 AA600057 
0 R40096 
S T89386 
3 AA262710 
5 AA489020 
3 AA441792 



Hs.28728 
Hs.161671 
Hs.108812 
Hs-211608 
Hs.15973 
Hs.163588 
Hs.77221 
Hs.23096 
Hs.174309 
Hs.7584 
Hs.28243 
Hs.67466 
Hs.65029 
Hs.23796 
Hs.22868 

Hs.24979 
Hs.247043 
Hs.42179 
Hs.28464 
Hs.51957 
Hs.70266 



Hs.38176 
Hs.108614 
Hs.193424 
Hs.22857 



105104 
107601 
131467 



AA436475 
AA151771 
AA004636 
W68255 
N66413 



KIAA0905 pr> 
ESTs 

KIAA0606 protein; SCN Circadian Oscillatory Protein (SCOP) 

KIAA0627 protein 

ESTs 

chord domain-containing protein 1 

Guanine Nucleotide-Binding Protein G25k 

EST86676 HSC172 cells I Homo sapiens cDNA 5' end, mRNA sequence 

ESTs 

ATPase; Na+/K+ transporting; beta 3 polypeptide 



1.817 
1.817 
1.815 
1,813 
1.812 
1.811 



ESTs 

ESTs; Weakly similar to hypothetical protein; similar to [H.sapiens] 
ESTs 

ESTs; Weakly similar to BACN32G1 1 .d [D.melanogaster] 
growth arrest-specific 1 

Human DNA sequence from clone 1052M9 on chromosome Xq25. Contains the 

protein tyrosine phosphatase; non-receptor type 11 

EST35035 Embryo, 6 week, subtracted (total cDNA) I Homo sapiens cDNA 

ESTs 

KIAA0525 protein 
ESTs 
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107969 AA034030 


Hs.155212 methylmalonyl Coenzyme A mutase 




115527 AA342079 


Hs.252055 ESTs 




132471 T16305 


Hs.49349 beta-site APP-cleaving enzyme 


1.775 


105966 AA4Q6105 


Hs.5344 adaptor-related protein complex 1 ; gamma 1 subunit 




127548 AA373091 


Hs.93832 Homo sapiens clone 24483 unknown mRNA; parital cds 


1.774 


106217 AA428379 


Hs.24870 ESTs 




131214 N26777 


Hs.1 72635 ESTs 


1.773 


106295 AA435654 


Hs.8583 similar to APOBEC1 




106328 AA436705 


Hs.28020 KIAA0766 gene product 




124661 N93797 


Hs.3090 EphB1 




122988 AA479166 


Hs.105633 ESTs 




115504 AA291946 


Hs.42736 ESTs 




105168 AA180208 


Hs.16606 ESTs; Highly similar to CGI-32 protein [H.sapiens] 




129153 AA188618 


Hs.181461 ariadne; Drosophila; homolog of 


1.766 


105829 AA398290 


Hs.21965 ESTs 


1.764 


101811 M86917 


Hs.24734 oxysterol binding protein 




100138 D13628 


Hs.2463 angiopoietin 1 




124704 R07335 


ye96rt.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 




122314 AA442257 


Hs.1 92076 ESTs 




109865 H02566 


Hs.191268 Homo sapiens mRNA; cDNA DKFZp434N174 (from clone DKFZp434N174) 




106206 AA428069 


Hs.895 1 9 KIAA1 046 protein 




107135 AA620782 


Hs.23247 ESTs 




105760 AA338960 


Hs.28170 ESTs 




106288 AA435536 


Hs.24336 ESTs 




103968 AA304566 


Hs.3542 ESTs 




129559 AA234945 
117885 N50112 


Hs.1 1360 ESTs 
Hs.47023 ESTs 


1.754 


107032 AA599472 


Hs.247309 succinate-CoA ligase; GDP-forming; beta subunit 




124807 R45963 


Hs.23381 1 ESTs; Weakly similar to ORF2 [M.musculus] 




100276 D42047 


Hs.82432 KIAA0089 protein 




110924 N47938 


yy84a09.s1 Soares_multiple_sclerosis_2NbHMSP Homo sapiens cDNA clone 


1.751 


133002 AF006082 


Hs.62461 ARP2 (actin-related protein 2; yeast) homolog 




132530 AA455917 


Hs.50785 SEC22; vesicle trafficking protein (S. cerevisiae)-like 1 




110759 N21671 


Hs.1 9025 ESTs 


1.75 


106138 AA424515 


Hs.33264 ESTs 




107348 U43701 


Hs.184776 ribosomal protein L23a 


1.75 


115867 AA432162 


Hs.1 65986 DKF2P586B2022 protein 




135398 AA1 94075 


Hs.99903 nuclear receptor coactivator 4 




113783 W19222 


Hs.7041 ESTs; Weakly similarto !! ALU SUBFAMILY SQ WARNING ENTRY !! [H.sapiens] 


1.747 


134898 X98330 


Hs.90821 ryanodine receptor 2 (cardiac) 




132215 T10132 


Hs.4236 K1AA0476 gene product 




104229 ABO02346 


Hs.61289 synaptojanin 2 


1.743 


116166 AA461556 


Hs.202949 KIAA1102 protein 




115433 AA284252 


Hs.58372 ESTs 




114908 AA236545 


Hs.54973 ESTs 




127425 AA470941 


Hs.143162 ESTs 


1.741 


131089 Z38807 


Hs.22870 ESTs 




113498 T8B908 


Hs.1 89746 ESTs 


1.738 


116710 F10577 


Hs.70312 ESTs 




127210 R51476 


yg76f04.r1 Soares infant brain 1NIB Homo sapiens cDNA clone 


1.733 


120554 AA279654 


Hs.1 94524 ESTs 




129940 U18242 


Hs.1 3572 calcium modulating ligand 


1.732 


117023 H88157 


Hs.41105 ESTs 


1.731 


111700 R22212 


Hs.23361 ESTs 




116911 H72240 


Hs.39292 ESTs; Moderately similar to KIAA0745 protein [H.sapiens] 


1.731 


106025 AA412063 


Hs.6065 ESTs 








1726 


111614 R12581 


Hs.191146 ESTs 


1726 


134134 L76703 


Hs.173328 protein phosphatase 2; regulatory subunit B (B56); epsllon isoform 


1.725 


106B86 AA489086 


Hs.36545 ESTs 


1.725 


117998 N52136 


Hs.93828 ESTs 


1.725 


121204 AA400422 


Hs.55896 ESTs 


1.725 


121342 AM04995 


Hs.1 92480 ESTs 


1.725 


131129 R27296 


Hs.23240 ESTs 


1.725 


116235 AA479181 


Hs.1 86726 ESTs 


1.725 


102423 U44754 


Hs.179312 small nuclear RNA activating complex; polypeptide 1 ; 43kD 


1.724 


110273 H29050 


Hs.24096 ESTs 


1.722 


108758 AA127395 


Hs.222414 ESTs 


1.722 


110672 H88477 


Hs.191178 ESTs 


1.721 
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134663 W73367 

104902 AA055475 

120582 AA281290 

134891 F03517 

106219 AM28567 

116372 AA521311 

107570 AA001870 

106198 M427816 

125136 W31479 

104973 AA085676 

128710 J04813 



Hs.90787 
Hs.26613 
Hs.13854 
Hs.237323 
Hs.11803 
Hs.129051 
Hs.6763 
Hs.104117 
Hs.107127 
127871 AA7B6511 Hs.128848 
116089 AA455933 Hs.41324 
Hs.1 32797 
Hs.1 62686 
HS21610 
Hs.88148 
Hs.136102 
Hs.77039 
Hs.27179 
Hs.31092 
Ks.48610 
Hs.49476 
Hs.21917 
Hs.100636 
HS241569 
Hs.606 
Hs.184122 



123337 AA504153 

123619 AA609200 

104781 AA026617 

115114 M256468 

117852 N49408 

127644 T57570 

111359 N91273 

131721 L36644 

132438 F08925 

132476 N67192 

130990 F02488 

128499 AA487503 

120780 AA342337 

132920 L06133 

135037 U77948 

110024 H11297 

134415 AA329274 

102223 U24685 



Hs.7942 
Hs.82112 
Hs.30824 
Hs.8215 



118250 N62602 

106470 AA450116 
108203 AA057678 
119748 W70313 
116576 D51228 
123035 AA481392 
126668 AA011616 
101512 M28209 
102704 U76638 
126218 AA256386 
111180 N67277 
105937 AA404342 
114118 Z38520 
109203 M1 90634 
125245 W86608 
102906 X06956 
125914 AA262925 
134294 U63289 
109742 F10108 
134674 D63876 
104079 AA402937 
107554 M001386 
132439 AA243139 
124515 N58172 
124300 H92575 
126809 AA743475 



Hs.250716 
Hs.54089 
Hs.13649 
Hs.9403 



Hs.1 75930 

Hs.1 08787 

Hs.7243 

Hs.75318 

Hs.180034 

Hs.81248 

Hs.1 83333 



Hs.1 09370 
Hs.105959 
Hs.171693 



ESTs; Waakly similar to ZINC FINGER PROTEIN 136 [H.sapiens] 
interferon-induced; hepatitis C-assooiated microti* ular aggregate prot (44kD) 
sex comb on midleg (Drosophila)-like 1 
ESTs 

clathrin; light polypeptide (Lea) 

ESTs; Weakly similar to BC331 191J |H.sapiens] 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586F1 323 (trom clone DKFZp586F1323) 
ESTs 

N-acetylglucosamine-phosphate mutase; DKFZP434B187 protein 

ESTs 

ESTs 

KIAA0942 protein 

cytochrome P450; subfamily IIIA (niphedipine oxidase); polypeptide 5 
Homo sapiens mRNA; cDNA DKFZp564G022 (from clone DKFZp564G022) 
ESTs 
ESTs 

ESTs; Weakly similar to ORF YGL050w [S.cerevisiae] 
ESTs 

ESTs; Highly similarto BAH -associated protein 1 [H.sapiens] 
ESTs 

KIAA0853 protein 
ribosomal protein S3A 
ESTs 
EphA5 
ESTs 

Homo sapiens clone TUA8 Cri-du-chat region mRNA 

KIAA0768 protein 

ESTs 

ESTs; Modtly smlr to II ALU SUBFAMILY SQ WARNING ENTRY !! [H.sapiens] 
ATPase; Cu++ transporting; alpha polypeptide (Menkes syndrome) 
general transcription factor II; i 
ESTs 

protein tyrosine phosphatase type IVA; member 2 
Human ant'-B cell autoantibody IgM heavy chain variable V-D-J region (VH4) 
gene; clone El 1 ; VH4-63 non-productive rearrangement 
ESTs 

rnterlaukin 1 receptor; type I 
ESTs 

ESTs; Weakly similarto double-stranded RNA-binding nuclear 
protein DRSBP76 [Ksapiens] 

Homo sapiens mRNA full length insert cDN A clone EUROIMAGE 783648 

yz75b6.s1 Soares_multiple_sclerosis_2NbHMSP Homo sapiens cDNA done 

IMAGE288851 3' similarto contains Alu repetitive element;, mRNA sequence 

ESTs 

ESTs 

ESTs 

neuron-specific protein 

ESTs 

ESTs 

RAB1; member RAS oncogene family 
BRCA1 associated RING domain 1 

Novel human gene mapping to chomosome 13; similarto rat RhoGAP 

ESTs 

ESTs 

ESTs 

endoplasmic reticulum membrane protein 

ublqultJn specific protease 24 

tubulin; alpha 1 (testis specific) 

cleavage stimulation factor; 3' pre-RNA; subun'it 3; 77kD 

CUG triplet repeat; RNA-binding protein 1 

ESTs 

KIAA01 54 protein 

ESTs 

ESTs 

Homo sapiens clone 25088 mRNA sequence 
ESTs 

ESTs; Waakly similar to !! ALU SUBFAMILY SQ WARNING ENTRY I! [H.sapiens] 
ESTs 



1.719 
1.717 
1.717 
1.717 



1.708 
1.707 
1.705 
1.705 
1.704 
1.702 



1.695 

1.694 

1.694 
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106095 AM19547 
101754 M77142 
105188 AA192306 
113582 T91371 
119559 W38197 
119961 W87535 
123255 AA490890 
111078 N59230 
113082 T40528 
119589 W44692 
104308 D53639 
103073 X59417 
124424 N35314 
128890 AA096157 
119400 T92767 

131631 AA486868 
118229 N62339 
118533 N67954 
130666 AA476307 
103093 X60708 
128667 U69140 
112933 T15530 
114546 AA056263 
126705 AA579377 
114399 M007595 
118836 N79820 
100401 D85423 
105681 AA284865 
132526 AA460128 
133809 AA034002 
115968 AA447083 



109644 F04477 

103427 X97303 
132186 T33888 
131428 U17838 
126638 AA649257 
114503 AA039568 
121242 AA400857 
122414 AA446885 
110632 H72344 
111389 N95837 
112449 R63802 
113070 T33464 
107229 D59284 
132710 W93726 
124664 N94814 
130166 AA350690 
125040 T78451 
132972 H39627 
115873 AA433916 
120408 AA235045 
120934 AA383773 
115259 AA279071 



133389 AA166917 

115528 AA3423Q1 

129704 W81301 

109313 AA206800 

130457 U58091 

123076 AA485211 

115113 AA256460 

117731 N46433 



Hs.16824 EST 



Hs.59015 

Hs.105273 

Hs.186574 

Hs.8245 

Hs.124177 

Hs.77904 

Hs.74077 

Hs.107265 

Hs.182364 



ESTs 
ESTs 
ESTs 

ribosomal protein S26 

proteasome (prosome; macropain) subunlt; alpha type; 6 
ESTs 

ESTs; Weakly similar to 25 kDa trypsin inhibitor [H.sapiens] 
ye27d06.s1 Stratagene lung (#937210) Homo sapiens cDNA clone 
IMAGE:1 18955 3', mRNA sequence, 
slit (Orosophila) homolog 2 
heat shock 90kD protein 1; alpha 
ESTs 

KIAA0737 gene product 

dipeptidylpeptidase IV {CD26; adenosine deaminase completing protein 2) 
fasciculation and elongation protein zeta 2 (zygin II) 
ESTs 



Hs.29802 
Hs.180532 
Hs.49413 
Hs.194035 
Hs.44926 
Hs.103419 
Hs.221439 
Hs.132747 

Hs.1 80532 heat shock 90kD protein 1 ; alpha 



Hs.50854 ESTs 

Homo sapiens mRNA for Cdc5, partial cds 
Hs.171228 KIAA1 040 protein 
Hs.5074 similar to S. pombe dim1 + 
Hs.76359 catalase 
Hs.1 34522 ESTs 

Hs.236204 ESTs; Moderately similar to NUCLEAR PORE COMPLEX 

PROTEIN NUP107 [R.norvegicus] 
Hs.204802 ESTs; Moderately similar to GLYCERALDEHYDE 3-PHOSPHATE 

DEHYDROGENASE; LIVER [H^apiens] 
H.sapiens mRNA for Ptg-12 protein 
Hs.221040 KIAA1038 protein 
Hs.26719 PR domain containing 2; with 2NF domain 
Hs.1 88602 ESTs 
Hs.188083 ESTs 
Hs.97509 EST 

Hs.99087 ESTs; Moderately similar to ZINC FINGER PROTEIN 141 [H.sapiens] 
Hs.171635 ESTs 

Hs.169111 ESTs; Weakly similar to L82A [D.melanogaster] 
Hs.124186 ring finger protein 2 
Hs.6298 ESTs 
Hs.34644 ESTs 

Hs.55279 protease inhibitor 5 (maspin) 

Hs.33540 ESTs;WeaklysimilartoKIAA0765protein[H.sapiens] 

Hs.151411 KIAA0916 protein 

Hs.199961 ESTs 

Hs.164967 ESTs; Weakly similar to !! ALU SUBFAMILY SB WARNING ENTRY !! [H.sapiens] 
Hs.90093 heat shock 70kD protein 4 
Hs.190151 ESTs 
Hs.191500 ESTs 

Hs.13453 splicing factor 3b; subunit 1 ; 155kD 

Hs.8185 ESTs; Highly similar to CGI-44 protein [H.sapiens] 

Hs.49007 poly(A) polymerase 

Hs.1 09896 ESTs 

Hs.111650 ESTs; Weakly similar to Prt1 homolog [H.sapiens] 
Hs.72639 ESTs 

Hs.53929 ESTs; Weakly similar to II ALU CLASS B WARNING ENTRY II [H.sapiens] 
Hs.12064 ubiquitin specific protease 22 

Hs.86276 ESTs; Moderately similar to zinc finger protein dp [H.sapiens] 
Hs.155976 cullin 4B 
Hs.190046 ESTs 
Hs.44610 ESTs 
Hs.46609 ESTs 

223 



1.547 
1.S46 
1.646 
1.645 
1.544 
1.642 



1.625 
1.624 
1.624 



1.615 
1.615 
1.611 
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114918 AA236813 
114807 AA160805 
105103 AA1S1593 
125004 T60120 



134675 AA250745 

105431 AAS52033 

120187 Z40251 

115830 AA428137 

135069 AA456311 

122997 AA479295 

119707 W67569 

131934 D80948 

106141 AA424558 

115271 AA279422 

131468 R27598 

131165 R98173 

117273 N21680 

101569 M33772 

116127 AA459703 

120022 W90625 

117512 N32157 

106511 AA452865 

116415 AA609204 

127879 AA810215 

125211 W72798 

114746 AA1 35638 

122698 AA456112 

116765 H12636 

130895 AA609828 

114338 Z41365 

111005 N53076 

128135 AA913491 

112046 R43365 

132160 AA281770 

111568 R10153 

127775 H04106 

115359 AA281936 

121845 AA425734 

127854 AA769520 

120287 AA187679 
114940 AA243012 
126716 AA031700 
134161 U97188 
125390 H95094 
115334 AA281244 
113721 T97931 
114895 AA236177 
119341 T62571 
108012 AA039616 
130335 AA156499 
134351 R82074 
133300 D51401 
106920 AA490899 
118744 N74075 
126489 W20016 
115913 AA436720 
107868 AA025234 
134520 N21407 



Hs.171857 

Hs.3238 

Hs.151791 



Hs.87773 
Hs.15036 
Hs.56974 
Hs.86434 



adenovirus 5 E1 A binding protein 

KIAA0092 gene product 

ESTs; Highly similar to unknown [H.sapiens] 

ESTs 

ESTS 

yb68f02.s1 Stratagene ovary (#937217) Homo sapiens cDNA clone 

IMAGE:76347 3', mRNA sequence. 

ESTs 

yt85e8.s1 Soares_pinea|_gland_N3HPG Homo sapiens cDNA clone 

IMAGE:231 1 1 3' similar to contains Alu repetitive element;, mRNA sequence 

ESTs; Weakly similar to hypothetical protein [H.sapiens] 

zn55d01 ,r1 Stratagene muscle 937209 Homo sapiens cDNA clone 

IMAGE:562081 5', mRNA sequence. 

protein kinase; cAMP-dependent catalytic; beta 

ESTs; Weakly similar to !! ALU SUBFAMILY J WARNING ENTRY !! [H.sapiens] 

ESTs 

ESTs 

ESTs; Weakly similar to I! ALU CLASS A WARNING ENTRY !! [H.sapiens] 

Kelch motif containing protein 

ESTs; Weakly similar to SNF2alpha protein [H.sapiens] 

ESTs 



1.599 
1.597 
1.596 
1.596 
1.596 
1.594 



Hs.5724 
Hs.27197 
Hs.23763 
Hs.43047 
Hs.182421 
Hs.79070 



Max-interacting protein 
ESTs 

troponin C2; fast 

v-myo avian myelocytomatosis viral oncogene homolog 

ESTs 

ESTs 

UOP-Gal:betaGlcNAc beta 1;4- galactosyltransferase; polypeptide 2 

KIAA0874 protein 

ESTs 

ESTs; Wkly smlr to cDNA EST EMBLD32579 comes from this gene [C.elegans] 

ESTs 

ESTs 

ESTs; Weakly similar to reverse transcriptase [H .sapiens] 

ESTs; Highly similar to tetracycline transporter-like protein [M.muxulus] 

KIAA0872 protein 

ESTs 

ESTs; ModrfJy smlr to !! ALU SUBFAMILY J WARNING ENTRY I! [Ksapiens] 
ESTs 

seven in absentia (Drosophila) homolog 1 
ESTs 

ESTs; Weakly similar to NG22 [Ksapiens] 
ESTs 

ESTs; Weakly similar to hypothetical protein [H.sapiens] 

ESTs; Weakly similarto REGULATOR OF MITOTIC SPINDLE 

ASSEMBLY 1 [H.sapiens] 

ESTs 

ESTs 

ESTs 

IGF-II mRNA-binding protein 3 

translocase of outer mitochondrial membrane 20 (yeast) homolog 

ESTs 

EST 

KIAA0887 protein 
microtubule-associated protein 7 
ESTs 

protein kinase; cAMP-dependent; regulatory; type II; alpha 

syndecan 1 

ESTs 

ESTs 



Hs.144228 ESTs; Weakly similar to ZINC FINGER PROTEIN 83 [H.sapiens] 
Hs.65487 ESTs 
Hs.61260 ESTs 



Hs.99410 

Hs.121585 

Hs.21015 

Hs.40109 

Hs.5996 

Hs.189143 

Hs.22273 

Hs.184081 

Hs.20561 

Hs.179902 

Hs.88914 

Hs.165066 



Hs.111114 

Hs.75928 
Hs.251962 
Hs.79440 
Hs.75187 
Hs.65300 
Hs.18190 
Hs.76591 



1.579 
1.577 
1.575 
1.575 
1.575 
1.575 
1.575 
1.574 
1.573 
1.573 
1.571 
1.571 
1.571 



1.566 
1.566 
1.566 



1.658 
1.557 
1.557 
1.553 
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106450 
104120 
100533 
130364 
127122 



102273 
121875 
114366 
132944 
111199 
113494 
129515 



3 F09684 
9 AA1 87938 
8 M443277 



3 L07044 
1 W60186 
3 W24957 

3 AA088851 
AA449469 
AA429838 



Hs.24792 

Hs.55189 

Hs.31034 

Hs.11171 

Hs.77823 

Hs.142607 

Hs.22265 

Hs.155691 

Hs.17207 

Hs.23165 

Hs.144260 

Hs.188006 

Hs.8768 

Hs.250857 

Hs.1 69487 

Hs.16281 

Hs.75744 
Hs.11859 
Hs.89519 



ESTs; Weakly similar to ORF YOR283w [S-cerevisiaa] 
ESTs; Weakly similar to F25B5.3 [C.elegans] 
or11A 



APG5 (autophagy 5; S. cerevisiae)-like 

ESTs 

ESTs 

ESTs 

pre-B-cell leukemia transcription (actor 1 

ESTs 

ESTs 

EST 

KIAA0878 protein 

ESTs; Wkly smlrto II ALU SUBFAMILY SQ WARNING ENTRY !! [Ksapiens] 
Homo sapiens calcium/calmodulln-dependent protein kinase II mRNA; partial cd 
Kreisler (mouse) maf-related leucine zipper homolog 
ESTs; Moderately similar to similar to C.elegans protein 
encoded in cosmid T20D3 [H.sapiensJ 
S-adenosylmethionine decarboxylase 1 



1.547 
1.546 
1.546 



t AA156049 

104785 AA027163 

105595 AA279408 

130198 U67156 

114297 Z40758 

112876 T03488 

127500 AA525014 

120519 AA258585 

119859 W80702 



130455 X17059 

125181 W58461 

127093 AA768241 

132156 AA157401 



AA101811 

AA219172 

X94703 

N77151 

H88798 

H22985 

AA1 15629 

H78003 

AA481414 

D80783 



128510 
132968 
117035 
116781 
108677 
130214 
134700 
116618 



KIAA1046 protein 
Ras-Like Protein Tc10 
ESTs 
ESTs 
Hs.8087 ESTs 
Hs.44625 ESTs 
Hs.43691 ESTs 
Hs.1 00445 ESTs 

Hs.75981 ublquitin specific protease 14 (tRNA-guanine transglycosylase) 
Hs.98510 ESTs 

Hs.469 succinate dehydrogenase complex; subunit A; flavoproteln (Fp) 
Hs.6127 ESTs; Weakly similar to prostate-specific transglutaminase [H.sapiens] 
Hs.29822 ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

mitogen-activated protein kinase kinase kinase 5 
DKFZP434K151 protein 
ESTs 
ESTs 

cadherin 19 (NOTE: redefinition of symbol) 
ESTs 

cytochrome P450; subfamily I (aromatic compound-indudble); polypeptide 2 
ESTs; Weakly similar to Su(P) [D.melanogaster] 



AA279153 Hs.190049 



Z41747 

AA054515 

N68210 



Hs.1 12227 
Hs.65490 
Hs.7942 
Hs.25866 
Hs.1 51 988 
Hs.1 73091 
Hs.4842 
Hs.162115 
Hs.129887 
Hs.58461 
Hs.1361 
Hs.42148 
Hs.210115 
Hs.166459 
Hs.134173 
Hs.155956 
Hs.12396 



Hs.4113 
Hs.1 07295 
Hs.5518 
Hs.41840 
Hs.14480 
Hs.69506 
Hs.86849 
Hs.100816 
Hs.61638 
Hs.41182 
Hs.52132 



ESTs 

N-acetyltransferase 1 (arylamine N-acetyltransferase) 
ESTs 

oa72d02.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone 
IMAGE:1317795 3', mRNA sequence. 
S-adenosylhomocysteine hydrolase-like 1 
ESTs 

Homo sapiens mRNA; cDNA DKFZp566J2146 (from clone DKFZp566J2146) 

ESTs 

ESTs 

ESTs 



RAB28; member RAS oncogene family 
myosin X 
ESTs 
ESTs 
ESTs 
Hs.15266 ESTs 

Hs.8868 golgi SNAP receptor complex member 1 
Hs.45224 ESTs 

tumor necrosis factor receptor superfamily; member 10b 
Hs.1 18797 tbiquitin-conjugating enzyme E2D3 (homologous to yeast UBC4/5) 
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1.519 
1.518 
1.518 
1.518 
1.517 
1.515 
1.515 
1.513 
1.513 
1.513 
1.512 
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Hs.8888 ESTs 

Hs.469 succinate dehydrogenase complex; subunit A; flavoprotein (Fp) 



126802 AA947601 
128661 R82837 
134194 AA233231 
108953 AA1 49652 
133240 D31161 
132671 X76302 
132609 Z48923 
105574 AA278678 
113718 T97782 
127824 AI208365 
130132 U55936 
127394 AA453224 
100485 HG1111-HT11 
101078 L04510 



Hs.97056 ESTs 
Hs.103329 KIM0970 protein 
Hs.79828 ESTs 
Hs.42128 ESTs 
Hs.68613 ESTs 

Hs.54649 putative nucleic acid binding protein RY-1 

Hs.53250 bone moiphogenetic protein receptor; type II (serine/threonine kinase) 

Hs.258567 ESTs 

Hs.256268 ESTs 

Hs.127811 ESTs 

Hs.1 84376 synaptosomal-associated protein; 23kD 

ESTs; Weakly similar to !! ALU SUBFAMILY J WARNING ENTRY !! [H.sapiens] 
11 Ras-Like Protein Tc21 

ADP-ribosylation factor domain protein 1; 64kD 



1.507 
1.507 
1.507 
1.506 



128611 AA456845 Hs.102471 KIAA0680 gene product 
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TABLE 12A shows the accession numbers for those primekeys lacking unigeneED's for 
Table 12. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey CAT number Accession 



108536 119811J 
117040 46956J 
20 100782 18457J 



35 102313 27608J 



50 

124704 292319J 

116988 185904J 

124825 330773J 

55 110455 46874J 

126257 182217 1 

125624 154135J 

104038 264235J 

103427 43892J 



AW9706QOAA5Q3323H89218AF086031 H89112 

AA355435 NM_001516 Z30093T28405 AW949486 AA461142 AA410532 AI652073 AA521208 AI970141 AI968234 AI026102 
AA713583 AW1 35876 AA936614 M770300 AI242635 AA377033 AW960263 AW607683 AI273603 AA41 0287 AI040513 
AA460838 AIB03916 AW294095 AW449680 AW798677 AW675048 BE5421 16 AL120521 

L34840 NM_003241 U31905 AI546931 AI791616 AI973065 AI792321 AI546937 AI685880 AI732835 AI682360 AA420653 
AA5B4047 AI682323 AI824614 AI659889 AI680052 AI970887 AI623108 AA420692 AI418074 AA631018 A181059S AW291463 



AI393237 AI521 31 7 AI761 348 AF025841 D43968 AW994987 L34598 AF025841 D89789 D89788 D89790 AW998932 
AI971742 AI310238 X90976 AW139668 AW674280 AI365552 AA877452 AV657554 C75229 AA376077 AI798056 AW609213 
W25586 H30149 BE075089 BE075190 AW580858 H99598 M425238 AA133916 AW363478 BE158121 BE158127 
AW467960 BE158135 BE158126 BE158145 N92860 AA847246 AI961688 AI3S1423 M878154 AA043767 AI863712 
AI559226 AW339007 AI371266 AI368901 AA046624 AA134739 AW449154 AA130232 AI458720 M96251 1 AI700627 
R70437 AW004008 M045229 AI671572 H99599 AA043768 AI685454 AI871685 N29937 X90977 AA524240 AI142114 
AI825750 AI567805 AI631365 AI347893 AA134740 F20669 AA046707 AW793216 AW963298 AW959380 AA363265 
AI784593AI268201 R69451 AV657618 AI695588 

BE312163 AJ230798 AA374482 AI926059 AA622653 AI860704 BE139185 AW296884 T60238 T60120 
U33921 AI190489AA573311 

AI814663 AA806761 AA765241 AA019317 AA092255 AA035405 T85079 AA890151 AI373959 T85080 BE153728 AA740848 
BE080682AL048137AW182316AI699468AW274481 AW407538 AA306562 AW950024 AW949943AL045703AW843196 
W25132 BE612794 AA304266 AW958054 H25673 AV646563 AV646573 BE172990 AW593488 AA385181 AA164998 
AI246476 AA345406 AI277554 AA134749 AA856624 BE613247 AA29S003 AL048138 AA028121 T92510 AI923835 
AW020440AI401594AI889401 N93290 M044247 AA028100 AI582845 AA811151 AI741811 AI925878 AA448277 M172221 
AI214783 BE220793 M022746 AI082882 AA022849 AI928385 AA573472 A1420686 AW072902 AI799493 AI873506 
AI468977 AI1 92079 AI468976 AA044272 AW015701 AW316979 AA933042 AA609017 AI318393 AI424571 AI934945 
AA172023 AW050917 AA846180 AA134748 AI003947 AI766769 AW006697 AA653517 AW575680 AI474214 AA401478 
U36922 M927064 M868000 D62654 T91745 AW500202 M194764 AA746346 AA1 30464 AW1 17498 AA054526 N26432 
H02534 H04964 AW303367 BE300931 AI218049 AI208073 AW1 82749 AA983630 AI147585 AA1 94765 AA054534AA922720 
AI4365B5 AI346535 AA134269 AA280923 M897422 AA019559 AW274010 AA035406 AA917879 H99327 W32908 AI216046 
AW496823 AA019414 H82288 W35284 AI936621 AI7671 13 AA866177 AW367874 H82398 AF032885 AW300151 AW467069 
AA809346 AI188507 AI494178 AA872752 AI631631 U02310 NM.002015 AA815006 AI382453 AW197658 AI761654 
AI804396 AI382221 AI813540 AI439535 AI52390J AW517242 AI221705 AW298104 AW204560 AW573095 AW028783 
AW014650 AI766744 AI808294 AI698758 AIO41809 AI766667 AI479103 AA872797 AA769305 M765080 AA334166 
AI472322 
R07335 R07640 

AW953679 AW953680 AA244436 H82527 AA361046 AA244483 H82526 
AA501669 R52088 
H52576AF085971 H52172 

N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

AW968363 AA465492 R34539 AA165411 

AA374532AA421255 

BE514383 AA071273 AW247987 AW673286 BE312102 AW749824 BE071985 AW577383 BE071945 BE072005 AW577355 
BE071965 AW239231 BE072000 BE071960 AW577360 AW749830 AW373020 X97303 AW999522 BE0001 92 BE562219 
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135197 2 
127394 3 



100485 3 

108345 1 
100522 1 



AW271838 AL1 33605 C01646 H29959 AA999896 D60676 AW999454 AW961 1 76 M315244 H14437 AW3861 18 N46512 
AW272021 AI768516 BE466421 AI082809 AI804454 AA905101 AW173368 N38942 AW614169 AI080483 N29489 AI500550 
AA994475 AA614464 AA707368 AA593145 AA569473 AW627815 AI828244 N63226 N42300 

NM_016353 AB023584 W44753 R09585 AA382865 R23772 AI814257 AA974046 AK001 608 AI935638 AW44O609 AI420022 
AA777386 AA806969 AI554876 AI584006 AI688556 AI688634 AI697997 AI014540 AI806683 AI741202 AW263154 
AW297238 AI149951 AI589076 AW082158 AW614265 AA931887 AA781969 R09490 AA484643 AI207121 AI088390 



AI244025 AI222558 W38425 AW473630 AI624599 AI921226 AI683152 AI096458 AI123822 AW170802 C16447 AI337674 
D25726 AW339366 AW771259 AA461 174 
I 1589048J H48372 W01626 



110924 6443J AW058463 AF195766 AA680145 T86901 W60373 W60281 NM 007222 AF1 06862 AI000795AA1 671 88 
AW884503 AW891313 AW891332 AW891312 AI984924 AI123518 N75170 AA131614 H25330 AI913358 AI742277 W25576 
R58771 AW445159 AW888628 AW888627 AW274674 AI08B482 N52314 N34282 AW001769 AI338943 T66784 AI288963 
AW46B676 AW237528 H25289 N71690 AA610128 AI143458 AI082599 N49144 AA854773 AW66341 1 AVV610151 N47938 
AW601626 AA1 67189 M918304 AA305205 BE069496 AA652836 BE069499 AI699298 AW249926 AW88B57B BE567635 
T10726 AW604715 D54245 D53062 D55610 D55555 AA301376 AI133498 N77788 AI936320 AW090734 AI269977 N50828 
AA550814 AI421993 AI005384 N5Q813 D60292 D59349 AA131710 D81698 D81B99 
AA331 156 AA331 157 AA331 155 

U76456 NM_O03256 AF057532 AA193414 AW293304 AW963378 M313095 AI359841 AI969312 AI080163 AW448926 
A1671 136 BE466399 AI637967 AI671873 AW196583 AW071635 AI634427 AW296872 AW292470 AA193650 
BE161832 AA453224 AA485772 



AA524886 AW971347 AA21 1537 

AW971327 AA5249B8 AW628653 AA251797 

AW976793AA769520 

AA432071 AA405648 AW000908 T16347 

AB028957 AL120001 AI267678 H10928 R19844 AW970334 AA393182 F05472 F1 1711 H09908 N50250 AI81541 1 BE463679 
D61468 AW970253 D60889 C15548 D61011 D60867 AI815795 AA534B31 D81386 AW235039 AI382158 D81174 M416899 
AA852310 H09789 H10929 H09813 F09369 R44721 D51515 Z38456 R14004T66255 F12148 F12139 AW351702 M85350 
AI01 8713 AW972450 AW972645 AA51 4964 T65172 F09785 F09776 AA436608 T05327 T071 1 8 AA339352 
AW301 608 N46706 AA649093 AA287595 AW81 1753 AA287596 N39260 

NM_001874 J04970 T91426 AW205201 T84979 AA255727 AA847837 R02164T91339 AV651884 AV651835 AV651350 
AV6501 18 AV651338 AI272002 AI367796 AA830651 AA2621 12 AW151 198 

AU076696 AA219720 AL135197 AA305877 N56376 AA318063 AA130725 AW954903 BE541230 AW383312 U86753 D85423 
AI679458 AI122932 AB007892 AI583919 BE160134 F08104 R34903 F13440 AA095444 AA262453 AA191036 R17895 
T81266 BE149776 AI279537 AI143113 AA361072 AW959030 AW268817 AAB1 1533 BE275179 AI221677 T65147 R49293 
AA249176 BE000290 AA768053 F09494 BE092645 BE172099 Z41 177 AA044750 AI909768 BE140795 BE140574 AW845210 
AW752452 BE243244 AA843664 AI3000B0 BE1 69032 AW1 89979 BE004869 AA621872 AI951772 AI678897 AI926598 
N62813 AI350912 AW608791 AI309602 AI983138 AW875592 AI655073 AW875626 AA130606 AI370827 C75528 C75554 
AW263335 AI344426 BE004788 AA576220 AA604824 AI431405 AA749378 R388B2 AW955075 AA173821 C75657 
AA219672AW768408 R43141AI431414M483343AI673792T17294AW770187 N74285AI476404AI0882B8AA654152 
AW974864 BE617311 BE243328 BE168049 

U64675 AW167507 AW167508 BE218568 AA779360 W85722 AL044843 BE159404 AF0120B6 AW89B61 1 AW898610 
BE159405 BE092191 AW890826AW369841 AW368064AW606702 AL044731 R82691 AA41 9346 AA41 6558 H96045 
AL040450 AI64Q531 AI808434 AUM6613 AWB55784 AW362469 AL048881 AL049015 AA094272 AA888903 AA417294 
AW237786 R59793 AL044916 DB2402 AI216854 AI079342 H96406 AL037845 AI915900 AA972133 AI478783 T31074 
Z21 135 221396 AA352182 R13918 AA430178 C17811 A1371824 AI742256 AA926801 N79156 AA35O310 AA081971 N83639 
R35544 AA312292 AW952080 N42322 M171957 AA565297 R89207 AA504106 AI630782 AAB26482 AI301579 T36241 
AW966618 Z28426 AL043480 AI124636 AA393449T19504 AW887823 AI289814 N53979 AL043571 AI632764 AI859613 
AI986308 AI683212 AI984499 AI133258 C05898 AW512761 AI041260 BE466240 Z19161 AI351 190 N67549 AI373374 
AA400873 AW440914 AW51 4879 AA770146 A1358754 R51 1 13 AI283773 AA649886 T30543 D54358 R37750 T03358 
T15451 T15880 AA999689 N67396 AI056289 T85597 N62441 R89099 R00035 T85596 R61335 R00128 N63359 AI535964 
AI207768 M31468 NM_012250 W01322 AA253280 AA253233 AA293143 AW582106 R79880 AA459547 AA363459 
AA234396 N31669 H44468 AA434587 AW363088 AW993541 



118250 
10367B 
119400 



14745.3 
genbank_N62602 
entre?_Z84483 
gBnbank_T92767 
entre*_W38197 



X51 501 NMJW2652 Y101 79 J03460 AI791618 AI821473 AA916588 AA564296 AA9161 10 AI972286 AI420470 AI568790 
AI597724 AW205207 AI659305 AI791620 AA532383 AI821475 AA526498 

NMJ12249 M31470 AL043108 AA262561 AA1788B3T29433 AA313329 W48807 AW404323 AA453560 AW403227 H94816 
W17101 M165152 W23989 AA091310 

AL121734 D54896 AA424269 BE242906 AA3621 18 BE018454 AI280348 AL048769 M35543 AA757734 AI128865 H20289 
H23728 AI203445 H41481 H18237 H44081 H92839 AI928621 H75675 D51 148 AI796198 AW390453 D55579 D54145 D53996 
D54015 R37664 H17541 AA668681 T65061 R15867 AW468123 R16049 H69030 M054226 H16070 F09655 R92144 T03521 
R05473 H92840 M018186 R91707 
U35637AA112989Z19308 
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5 



TABLE 13: shows genes, including expression sequence tags, up-regulated in prostate tumor 
tissue compared to normal prostate tissue as analyzed using Affymetrix/Eos Hu02 GeneChip 
array. Shown are the ratios of "average" normal prostate to "average" prostate cancer tissues. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession nui ' " ' ' 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 ; Background subtracted normal prostata : prostate tumor tissue 



Pkey ExAccn 


UnigeneIC 


i Unigene Title 


R1 


333516 




CH22_FGENES.173_1 


0.028 


337954 




CH22_EM:AC005500.GENSCAN.96-3 


0.029 


332496 R73299 


Hs.204354 


ras homotag gene family; member B 


0.03 


337944 




CH22_EM:AC00550O.GENSCAN.89-7 


0.033 


334111 




CH22_FGENES.330 10 


0.033 


333S57 




CH22_FGENES.241_2 


0.034 - 






CH.04_hsgi|6525284 


0.034 


336355 




CH22_FGENES.817_5 


0.035 


322011 AL137354 




EST cluster (not In UniGene) 


0.035 


336377 




CH22_FGENES.821_5 


0.036 


300254 AW079507 


Hs.188417 ESTs; Weakly similar to ZnT-3 [H.sapiens] 


0.037 


330096 




CH.19J32 gi|6015278 


0.037 


335191 




CH22 FGENES.507 6 


0.038 


334040 




CH22_FGENES.322_8 


0.039 


333586 




CH22_FGENES.204_2 


0.04 


333295 




CH22_FGENES.132 2 


0.042 


313326 AI088120 


Hs.122329 


ESTs 


0.043 






CH.10_p2gi|3983513 


0.043 






CH22 FGENES.144 21 


0.043 






CH22 FGENES.513 11 


0.044 






CH22 F6ENES.S52 11 


0.045 


333637 




CH22 FGENES.229 2 


0.046 


334582 




CH22_FGENES.407_5 


0.046 






CH22 FGENES.826 4 


0.047 


337461 




CH22_FGENES.782-1 


0.047 


302892 N58545 


Hs.6975 


histone deacetylase 3 


0.049 


338689 




CH22 EM:AC005500.GENSCAN.475-3 


0.049 


334721 




CH22_FGENES.421.32 


0.049 


305867 AA864572 




EST singleton (not in UniGene) with exon hit 


0.049 


335498 




CH22 FGENES.571 7 


0.05 


311596 AI682088 


Hs.223368 


ESTs 


0.05 


326959 




CH21_hsgi|6469836 


0.051 


311688 AW025661 


Hs.240090 ESTs 


0.052 


317298 AI922374 


Hs.158549 ESTs 


0.052 


332984 




CH22 FGENES.54 6 


0.052 


321039 AW247083 




EST cluster (not in UniGene) 


0.053 


335844 




CH22_FGENES.623_4 


0.053 


325371 




CH.12_hsgi|5866920 


0.054 


335667 




CH22.FGENES.590J 8 


0,054 


333635 




CH22_FGENES.228_2 


0.054 


336736 




CH22 FGENES.110-2 


0.055 


335893 




CH22_FGENES.S35_1 


0.055 


333170 




CH22_F6ENES.94 5 


0.055 


329768 




CH.14_p2 gi|6015501 


0.055 


334030 




CH22 FGENES.320 2 


0.055 


323359 AA234172 


Hs.137418 ESTs 


0.055 


300453 AW051431 


Hs.113029 


ribosomal protein S25 


0.055 


334262 




CH22_FGENES.367_12 


0.055 


306590 AI000246 




EST singleton (not in UniGene) with exon hit 


0.055 


331087 R22520 


Hs.23398 


ESTs 


0.055 


338620 




CH22_EM:AC005500.GENSCAN.450-18 


0.056 


339045 




CH22 DA59H18.GENSCAN.28-5 


0.056 


308023 AI452732 




EST singleton (not in UniGene) with exon hit 


0.057 
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339067 CH22_DA59H18.GENSCAN.33-3 0.057 

335689 CH22_FGENES.596_4 0.057 

339069 CH22 DA59H18.GENSCAN.33-5 0.057 

338176 CH22_EM;AC005500.GENSCAN.219-4 0.057 

5 328159 CH.06_hsgi|5868065 0.058 

335655 CH22_FGENES.590_6 0.058 

336371 CH22_FGENES.820_1 0.058 

336558 CH22_FGENES.842_3 0.059 

337738 CH22_EM:AC000097.GENSCAN.100-4 0.059 

10 334273 CH22_FGENES.369_2 0.059 

335889 CH22_FGENES.633_3 0.059 

327807 CH.05_hsgi|5867968 0.059 

333315 CH22_FGENES.138 7 0.059 

338825 CH22_DJ246D7.GENSCAN.4-6 0.06 

15 337612 CH22_C20H12.GENSCAN.22-5 0.06 

333897 CH22_FGENES.293_4 0.06 

335990 CH22_FGENES.655_4 0.06 

334264 CH22_FGENES.367J5 0.06 

338653 CH22_EM:AC005500.GENSCAN.460-39 0.061 

20 322303 W07459 EST cluster (not in UniGene) 0.061 

333498 CH22_FGENES.168 8 0.061 

336522 CH22_FGENES.839_3 0.061 

301357 AW295677 Hs.137840 ESTs; Moderately similar to HOMEOBOX 

PROTEIN SIX1 [Rsapiens] 0.062 

25 305917 AA876469 Hs.181357 laminin receptor 1 (67kD; ribosomal protein SA) 0.062 

336143 CH22_FGENES.705_5 0.063 

333493 CH22_FGENES.168_2 0.063 

332533 M99487 Hs.1915 folate hydrolase (prostate-specific membrane antigen) 1 0.063 

325844 CH.16_hsgi|6552453 ' 0.063 

30 336402 CH22_FGENES.823_17 0.063 

335767 CH22_FGENES.607_1 0.064 

301893 T80334 EST cluster (not in UniGene) with exon hit 0.064 

324019 AW1 77009 EST cluster (not in UniGene) 0.064 

305801 AA845997 EST singleton (not in UniGene) with exon hit 0.064 

35 335188 CH22_FGENES.507_3 0.065 

337533 CH22_FGENES.828-2 0.065 

333311 CH22_FGENES.138_3 0.065 

335668 CH22_FGENES.590_19 0.065 

306786 AI041589 EST singleton (not in UniGene) with exon hit 0.066 

40 306365 AA962086 EST singleton (not in UniGene) with exon hit 0.066 

306249 AA933840 EST singleton (not in UniGene) with exon hit 0.066 

335018 CH22_FGENES.474_6 0.066 

333594 CH22_FGENES.210_3 0.066 

333900 CH22_FGENES.293_7 0.066 

45 325207 CH.10_hsgi|6552430 0.067 

329888 CH.15_p2gi|6067149 0.067 

326238 CH.17_hsgi|5867260 0.067 

333658 CH22_FGENES.241_4 0.067 

335809 CH22_FGENES.617_6 0.068 

50 307427 AI243437 EST singleton (not in UniGene) with exon hit 0.068 

318428 AI949409 Hs.224583 ESTs 0.069 

327005 CH.21_hsgi|5867664 0.069 

330463 HG998-HT998 Sulfotransferase, Phenol-Preferring " 0.069 

333318 CH22_FGENES.138_10 0.07 

55 333313 CH22_FGENES.138_5 0.07 

325937 CH.16_hsgi|5867132 0.07 

335663 CH22_FGENES.590_14 0.07 

335349 CH22_FGENES.539_2 0.07 

303396 AA224470 Hs.25426 ESTs; Weakly similar to unknown [H.sapiens] 0.07 

60 332603 N66681 Hs.33470 ESTs 0.07 

333310 CH22_FGENES.138_2 0.071 

309924 AW340812 EST singleton (not in UniGene) with exon hit 0.071 

336340 CH22_FGENES.814_15 0.071 

308025 A1453365 Hs.172928 collagen; type I; alpha 1 0.071 

65 306805 AI055966 EST singleton (not in UniGene) with exon hit 0.071 

335499 CH22_FGENES.571_8 0.071 

329669 CH.14_p2gi|6272129 0.071 

321666 D28390 EST cluster (not in UniGene) 0.071 

338174 CH22_EM:AC005500.GENSCAN.219-2 0.072 



231 



WO 02/30268 



PCT/US01/32045 



3322S0 N62712 
328506 

331756 AA291468 

317729 AA971718 

304515 AA458708 

313644 AI565766 
326145 
336394 

306516 AA989542 

300629 AA152119 



330513 M81057 
308905 AI359636 
337419 



305830 AA857665 
302928 AL1 37719 
304321 M1 36698 



305060 AA635771 
306051 AA905130 
308289 AI571211 



CH22_FGENES.842_1 0.072 

Hs.140 immunoglobulin gamma 3 (Gm marker) 0.072 

CH22_FGENES.46-1 0.072 

CH.21_hsgi|6004446 0.073 

CH22_FGENES.303_1 0.074 

CH22_FGENES.104_5 0.074 
Hs.174007 ESTs; Modarately similar to 111! ALU SUBFAMILY J WARNING 0.074 

CH22_DA59H18.GENSCAN.44-9 0.074 

CH.06_hsgi|5868031 0.075 
Hs.226223 KIAA0618 gene product 

CH.07_hsgi|5863471 
Hs.98504 ESTs 

CH22_FGENES.507_8 
Hs.128141 ESTs 
Hs.251577 hemoglobin; alpha 2 
Hs.124960 ESTs 

CH.17_hsgi|5867204 
CH22_FGENES.823_6 
EST singleton (not in UniGene) with ex 



0.075 
0.075 
0.076 
0.076 
0.076 
0.076 
0.076 
0.077 
0.077 

Hs.155101 ATP synthase; H+ transporting; mitochondrial F1 complex; alpha subunil; 



isoforml; cardiac muscle 



EST singleton (not in UniGene) with exon hit 
Hs.193689 ESTs 

CH22_FGENES.3_2 
Hs.180884 carboxypeptidase Bt (tissue) 
Hs.8102 ribosomal protein S20 

CH22_FGENES.759-4 

CH22_FGENES.157_8 

CH22_FGENES.440_3 

CH.xThsgi|5868569 



EST singleton (not in UniGene) with exon hit 
EST cluster (not in UniGene) with exon hit 
Hs.1 13029 ribosomal protein S25 
CH.19_hsgi|5867340 
CH22_FGENES.514_2 
CH22_FGENES.412_6 



EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 



Human unidentified gene complementary to P450c21 
gene; partial cds 

CH22_EM:AC005500.GENSCAN.13-18 



Hs.253100 EST 

CH22_FGENES.75_7 

CH22_EM:AC005500.GENSCAN.477-25 
Hs.97312 ESTs 

CH.17_hsgi|5867203 

CH.07_hsgi|6004478 
Hs.165439 arsA (bacterial) arsanite transporter; ATP-binding; homolog 1 

CH22_FGENES.373_8 



331809 AA402482 

326138 

328304 

330570 U60276 



325839 CH.16 hs gi|6552452 

333531 CH22_FGENES.175_18 

330385 AA449749 Hs.31386 ESTs; Highly similar to secreted apoptosis related protein 
323305 AA811351 



0.077 

0.077^ 

0.077 

0.077 

0.077 



0.078 
0.078 
0.079 



0.081 
0.081 
0.081 
0.081 



0.083 
0.083 
0.083 
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CH22_FGENES.633_2 






306008 AA894390 


EST singleton (not in UniGens) with axon hit 








CH22_FGENES.365_15 






318303 AW451197 


Hs.1 13418 ESTs 








CH.02_p2 gi|6648220 








CH22_FGENES.41-1 






320506 AI815668 


Hs.157476 sucl -associated neurotrophic factor target 2 








(FGFR signalling adaptor) 






316974 AI740721 


Hs. 128292 ESTs 








CH22 FGENES.832_9 






335750 


CH22 FGENES.6Q2 4 


0.085 




335676 


CH22_FGENES.594_1 


0.086 










15 


310932 AI933861 


Hs, 222852 ESTs 






CH22_FGEN ES.502_4 








CH22_FGENES.373„9 








CH22_FGENES.433_5 








CH22_FGEN ES.301 _2 








CH22_FGENES.823_35 








CH22_FGENES.272_6 






33i ? 9 r 


CH22_FGENES.652_4 






301737 AI815981 


EST cluster (not in UniGene) with exon hit 


n'fW7 






CH22_DA59H 1 8.GENSCAN.44-6 




25 


326241 


CH.04_hs gi[65252B3 
CH.17_hs gl|58S7260 


n'niin 






CH22_EM"ACOO550O.GENSCAN .331-4 








CH.05_hs gi|5867961 






305266 M679772 


EST singleton (not in UniGene) with exon hit 






334359 


CH22_FGENES.378_4 






CH22_FGENES.571_10 


n'raa 




329687 


CH.14_p2gi|6117856 








CH22_FGEN ES.240_2 






324430 AA464018 


EST cluster (not in UniGene) 






325999 


CH.16_hs gi|5867073 




334832 


CH22_FGENES.439_1 








CH22_DA59H18.GENSCAN.49-3 


nnflQ 




300896 AI916902 


Hs.2 13882 ESTs 








CH.07_hs gi|5868309 




4U 




CH22_FGENES.480_1 






CH.14_p2 gi[6469354 






333656 


CH22_FGENES.240_4 






326180 


CH.17_hsgi|5867211 








CH22_FGENES.144_6 








CH22_EM:AC0055Q0.GENSCAN .306-3 




305396 M721052 


EST singleton (not in UniGene) with exon hit 








CH22_FGEN ES.795-7 






326424 


CH.19_hsgi|E867369 






306454 AA977992 


EST singleton (not in UniGene) with exon hit 


nno 


50 




CH22_DJ32l10.GENSCAN.7-6 




327470 


CH.02_hs gi|5867772 






333165 


CH22 FGENES.91 7 


0.O9 




307155 AI186738 


Hs.1 82426 ribosomal protein S2 


0.09 




330717 AA233926 


Hs.23635 ESTs 






335334 


CH22_FGEN ES.535J 0 








CH22_FGENES.636_2 


nno 






CH22 FGENES.292_7 


0.09 




331034 N51868 


Hs.31965 ESTs; Moderately similar to 40S RIBOSOMAL 








PROTEIN S20 [H.sapiens] 


0.09 


60 


304660 AA534416 


Hs.162185 ESTs 


0.09 


328217 


CH.06_hsgi|E868096 


0.091 




336068 


CH22 FGENES.684J3 


0.091 




302833 AA295381 


Hs.44423 ESTs 


0.091 




328668 


CH.07 hsgi|5868254 
CH22 FGENES.532_2 


0.091 
0.091 


65 


335309 
338481 


CH22_EM:AC005500.GENSCAN.377-5 


0.091 




306286 M936892 


EST singleton (not in UniGene) with exon hit 


0.091 




305070 AA639783 


EST singleton (not In UniGene) with exon hit 


0.091 




304870 AA594811 


Hs.1 19122 ribosomal protein L13a 


0.091 




303856 M968589 


Hs.944 glucose phosphate isomerase 


0.091 
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323789 AI459B12 


Hs.170460 ESTs; Weakly similar to KIAA0990 protein (H.sapiens] 


0.092 
0.092 


334910 
326332 


CH22_FGENES.455_3 
CH.19_hsgi|5867327 


0.092 


332467 AA489630 


Hs.1 19004 KIAA0665 gene product 


0.092 




CH22 EM:AC005500GENSCAN 402-7 






CH22„FGEN ES.829J5 






CH22_FGENES.250„24 






CH22 FGENES.842_4 




333230 


CH22_FGENES.107_10 


0^093 


333133 


CH22_FGENES.83_9 


0.093 


334885 


CH22_FGENES.451 Jl 1 




330605 X02419 


Hs.77274 plasminogen activator; urokinase 




336392 


CH22_FGENES.823_4 




334083 
325469 


CH22_FGENES.327_38 
CH.12_hs gi|6017034 
Hs.19039 ESTs 




331077 R09531 
303701 AW500732 


EST cluster (not in UniGene) with exon hit 


ntm 


334218 


CH22_FG EN ES.358_3 


0 093 


336542 


CH22„FGENES.840_6 




337151 


CH22_FG EN ES.546-1 






CH22~"fG EN ES.297-4 


0093 


334680 


CH22_FGENES.419_2 




326365 


CH.18_hs gi|5867297 




338952 




0093 




CH22_FGENES.832-4 




333546 


CH22_FGENES.180_2 




335258 


CH22_FGENES.518_3 


~™J 


336786 


CH22_FGENES.168-19 




321644 AI204177 


Hs.237396 ESTs 




335943 


CH22_FGENES.646_17 




327918 


CH.06_hs giI5868165 


0 094 


306398 AA970548 


EST singleton (not in UniGene) with exon hit 




335671 


CH22_FGENES.592_3 






CH22_FGENES.475_1 1 


O094 




CH22_EM:ACQ05500.GENSCAN.290-2 




332061 AA504812 


Hs.1 92824 early B-cell factor 




305153 AA654582 


Hs.77039 riboscmal protein S3A 


0094 




CH22_FGENES.292_2 


0^094 


323940 AI864428 


Hs.170880 ESTs 


0.094 


313779 AA648796 


Hs.129771 ESTs 


0.095 


323109 AA169345 


EST cluster (not in UniGene) 


0.095 




CH22_FGENES.38_4 






CH22_FG ENES.543_6 




303887 R72672 


Hs.193484 ESTs; Weakly similar to Similarity with yeast gene 






L3502.1 [Celegans] 






CH22_FGENES.727_3 


0095 


311280 AI767957 


Hs.1 97737 ESTs; Weakly similar to Y38A8.1 gene product [Celegans] 




337256 


CH22_FGENES.648-3 




308B14 AI819263 


EST singleton (not in UniGene) with exon hit 




334659 


CH22_FGENES.418_7 
CH22_FGENES.635_3 


0.095 


335895 

321697 AW388061 


Hs.4953 golgi autoantigan; golgin subfamily a; 3 


0.095 


336010 

302824 U21260 


CH22_FGENES.668_8 

EST cluster (not in UniGene) with exon hit 


0.096 


333612 


CH22_FGENES.217_7 


0.096 


304823 AA584837 


EST singleton (not in UniGene) with exon hit 


0.096 


335665 


CH22_FGENES.590_16 


0.096 


306518 AA989598 


EST singleton (not in UniGene) with exon hit 


0.096 


335243 


CH22 FGENES.516_4 


0.096 


335436 


CH22_FGENES.559_5 


0.096 


300243 AI420256 
332810 


Hs.161271 ESTs 


0.096 


CH22_FGENES.7_12 


0.097 


308612 AI735634 


EST singleton (not in UniGene) with exon hit 
CH22 FGENES.618.6 


0.097 
0.097 


335818 
325838 


CH.16_hsgi|6552452 


0.097 


337482 


CH22 FGENES.795-6 
CH22 FGENES.26-1 


0.097 
0.097 


336645 
337293 


CH22LFGENES.675-1 


0.098 
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329B93 


CH.15_p2gi|6525313 






326533 


CH.19_hsgi|5867441 






334905 


CH22_FGENES.452_20 






306347 AA961144 


EST singleton (not in UniGene) with exon hit 




J 




CH22_FGENES.43-4 








CH22_DA59H18.GENSCAN.69-7 






335774 


CH22_FGENES.607_10 






339216 


CH22_FF1 13D1 1 .GENSCAN.6-1 1 




in 


335311 


CH22_FGENES.532_4 
CH.11_p2 gi|6729060 


nrwr. 




328595 


CH.07_hsgi[5868224 






326928 


CH.21 hsgi|6456782 


0098 




315234 AI079680 


Hs.120770 ESTs 


o!(»8 


15 


306082 AA9O8508 


EST singleton (not in UniGene) with exon hit 


0.098 


305710 AA826544 


EST singleton (not in UniGene) with exon hit 






318540 T30280 


EST cluster (not in UniGene) 






337553 


CH22_C4G1 .GENSCAN.2-1 






320951 AA344069 


Hs.202699 neurexophilin 4 




2.0 


303B45 T08033 


EST cluster (not in UniGene) with exon hit 




338981 


CH22_DA59H18.GENSCAN.2-5 






321313 R87365 


Hs.26058 ESTs; Weakly similar to p532 [H.sapiens] 






328343 


CH.07_hs gi|5868383 


0.099 ^ 




332203 H49388 


Hs.102082 EST 




25 


301780 R07064 


EST cluster (not in UniGene) with exon hit 




332095 AA608838 


Hs.162681 EST 


nnqq 




316442 AA760894 


CH22_FGENES.107_5 
Hs.153023 ESTs 


0.099 




326001 


CH.16_hs gi|5867073 




30 


334363 


CH22_FGENES.378_11 


0.099 


338895 


CH22_DJ32i1 0.GENSCAN.9-2 






327460 


CH.02_hs gi[6004455 






332705 T59161 


Hs.76293 thymosin; beta 1 0 






307805 AI351739 


EST singleton (not in UniGene) with exon hit 




35 


322800 F25037 


Hs.225175 ESTs 




304918 AA602697 


EST singleton (not in UniGene) with exon hit 






334327 


CH22_FGENES.375_4 






318359 AI097439 


Hs.135548 ESTs 


ft] 




326644 


CH.20_hs gi|5867559 




40 


334454 


CH22_FGENES.388_3 




327959 


CH.06_hsgi|5868210 


0.1 




323783 AA330586 


Hs.131819 ESTs 






309198 AI955915 


Hs.248038 major histocompatibility complex; class I; C 






339265 


CH22_BA354l12.GENSCAN.10-3 


0.1 


45 


320576 AL049977 


Hs.162209 Homo sapiens mRNA; cDNA DKFZp564C122 






(from clone DKFZp564C122) 






338132 


CH22_EM:AC005500.GENSCAN .200-2 


0.1 




333163 


CH22_FGENES.91_5 






337584 


CH22_C20H1 2.GENSCAN.5-1 




50 


307588 AI285535 


EST singleton (not in UniGene) with exon hit 




336969 


CH22_FGENES.378-2 


0.101 




327535 


CH.02_hsgi|6525279 






328732 


CH.07_hs gi|5868289 






336686 


CH22_FGENES.46-3 




__ 


335777 


CH22_FGENES.607_13 




332944 


CH22_FGENES.47_3 


nini 




333174 


CH22_FGENES.95_1 








CH22 FGENES.821 8 


0.101 




330571 U60800 


Hs.79089 sema domain; immunoglobulin domain (Ig); 




60 




cytoplasmic domain; (semaphorin) 4D 


0.101 


331789 AA398721 


Hs.186749 ESTs 


0.101 




338915 


CH22 DJ32I10.GENSCAN.12-1 


0.101 




334844 


CH22 FGENES.439_24 


0.101 




336642 


CH22 FGENES.23-4 


0.101 


65 


334906 


CH22 FGENES.452_21 


0.101 


333188 


CH22 FGENES.98_8 


0.101 




300088 AW299993 


EST cluster (not in UniGene) with exon hit 


0.101 




329373 


CH.X hsgi|6682537 


0.102 




331120 R46576 


Hs.23239 ESTs 


0.102 




335856 


CH22.FGENES.628J 


0.102 
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331888 AA431337 


Hs.98017 












CH22_FGENES.89_4 






335989 




CH22_FGENES.655_2 






304385 AA235602 




EST singleton (not In UniGene) with exon hit 




- 

J 






CH22_EM:AC005500.GENSCAN. 133-1 


nin? 








CH22_FGENES.507J5 






318595 T39486 


Hs.6137 












CH22_FGENES.25Q_1 1 


n'irw 


10 


306526 AA989713 




EST singleton (not in UniGene) with exon hit 




328734 




CH.07_hs gi|5868289 






307294 AI205612 




ribosomal protein; largo; PO 


0103 








CH.02_hs gi|5867751 






335872 




CH22_FGENES.630_3 




15 






CH22_FGENES.189_1 


nim 


334774 




CH22_FGENES.430_6 










CH22_EM:AC00550O.GENSCAN.462-1 


0J03 








CH.20_hs gi|5867595 










CH22_FGENES.31Q_18 






335800 




CH22_FGENES.61 3_4 






318113 AI187943 










337278 




CH22_FGENES .665-1 






336386 




CH22_FGENES.822_6 






334790 




CH22_FGENES.432_15 






303778 AW505368 




EST cluster (not in UniGene) with exon hit 








CH22_FGENES.839_5 






328936 




CH.08_hs gi|5868500 






335102 




CH22_FGENES.494_7 


0.104 




300935 AA513644 


Hs.222815 


ESTs; Weakly similar to Wiskott-Aldrich Syndrome 




30 






protein [H.sapiens] 


307581 AI284415 




EST singleton (not in UniGene) with exon hit 






317301 AW291683 


Hs.226056 








335330 




CH22_FGENES.535_3 






337968 




CH22_EM:AC005500.6ENSCAN.103-2 






335627 




CH22_FGENES.584_7 


°'inl 


35 


336274 




CH22_FGENES.762_2 










CH22_FGEN ES.424_5 


nins 




334409 




CH22_FGENES.383_6 










CH.01„hs gi|5867544 


0105 








CH22 - FGENES.138_13 






303181 AA452366 




EST cluster (not in UniGene) with exon hit 










CH22_FGENES261_2 










CH22_EM:AC005500.GENSCAN .276-3 






334282 




CH22_FGENES.369_12 




45 


330190 




CH.05_p2gi|6165182 




310748 AW014249 


Hs.158698 




nirv; 




338150 




CH22 EM:AC005500.GENSCAN.207-2 






336719 




CH22_FGENES.82-6 






330228 




CH.05_p2gi|6013527 


nins 


50 


327801 




CH.05_hsgi|5867924 




330525 S75168 


Hs.274 


megakaryocyte-associated tyrosine kinase 










CH22_FGENES.468_2 


niriq 




335111 




CH22_FGENES.494_19 






334483 




CH22 FGENES.395_5 




55 


328829 




CH.07_hsgi|5868337 




302753 M74299 




EST cluster (not in UniGene) with exon hit 










CH22 FGENES.398J0 


0106 




330024 




CH.16_p2 gl|6671908 


0.106 




321030 AI769930 


Hs.233617 Homo sapiens (clone B3B3E13) Huntington's 










disease candidate region 


0.107 


60 


333410 




CH22 EM:AC0055Q0.GENSCAN.341-6 


0.107 




334353 




CH22_FGENES.376_5 


0.107 




338276 




CH22 EM.-ACOO550O.GENSCAN.288-9 


0.107 




329053 




CH.X_hsgi|5868574 


0.107 


65 


336560 




CH22 FGENES.842.5 


0.107 


332158 M621363 


Hs.1 12980 


EST 


0.107 




336447 




CH22_FGENES.829_4 


0.107 




333703 




CH22_FGENES550_17 


0.107 




326207 




CH.17_hsgi|5867222 


0.107 




333232 




CH22_FGENES.108_1 


0.107 
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CH22_FG ENES.435_1 




303784 AA704983 


EST cluster (not in UniGene) with exon hit 






CH22 DJ246D7.GENSCAN.10-2 




339407 


CH22_DJ579N16.GENSCAN.1-9 


o!t08 




CH22_C20H12.GENSCAN.32-8 




334650 


CH22_FGENES.417_17 


o!l08 


308511 AI687580 


EST singleton (not in UnlGene) with exon hit 




333392 


CH22_FG ENES. 1 44_8 




325840 


CH.16_hs gi|6552452 




315044 AW205664 


Hs.1 29568 ESTs 


0 108 


333298 


CH22_FGENES.133_4 


o!l08 




CH22_FGENES.501_7 






CH22_FGENES.137_2 


oinB 




CH.19_hs gi[5867327 






CH22_FGENES.482_1 




305185 AA663985 


Hs.248038 major histocompatibility complex; class I; C 
CH22_FGENES.590_9 


nira 


323040 AA336609 


Hs.10862 ESTs 

CH22__FGENES.699-6 


0 1 OB 




CH22_BA354l12.GENSCAN.9-6 




321202 H54052 


Hs.163639 ESTs; Weakly similar to INTERCELLULAR ADHESION 






MOLECULE-1 PRECURSOR [H.sapiens] 




331792 AA398968 


Hs.97548 EST 

CH22_FGENES.278_2 




321325 AB033100 


EST cluster (not in UniGene) 


0.109 


331373 AA435513 


Hs.1 78170 ESTs; Weakly similar to DUAL SPECIFICITY 






PROTEIN PHOSPHATASE 3 






CH.07_hS gi|5868309 




335105 


CH22_FGENES.494_I0 




300975 AI283548 


Hs. 149668 ESTs 


0.109 


324893 T31940 


EST cluster (not in UniGene) 


0.109 


333397 


CH22_FGENES.144_15 








nira 




CH22_FGENES.571_22 




336373 


CH22_FGENES.820_3 




336188 


CH22_FGENES.717_12 


0.109 


313455 AW081702 


Hs.137329 ESTs 






CH22_FGENES.506_4 




306814 AI066577 


EST singleton (not in UniGene) with exon hit 




311130 AI632322 


Hs.1 95306 ESTs 




310882 AW080339 


Hs.211911 ESTs 




323383 AI346359 


Hs.1 35209 ESTs 




300212 AW1 35925 


Hs.184552 biphenylhydrolase-lika (serine hydrolase; breast epithelial 
mucin-assoc. 


nil 




CH.14_hs gi|5867014 






CH.19_p2 gi|6015278 




331942 AA453261 


Hs.99309 ESTs 






CH22_FGENES.421_34 






CH22_FGENES.217_9 






CH22_FGENES.692-1 




305057 AA635626 


Hs.62954 ferritin; heavy polypeptide 1 






CH22 EM:AC005500.GENSCAN.480-3 


011 


335385 


CH22_FGENES.543_27 


0/11 


338012 


CH22_EM:AC005500.GENSCAN.128-10 


0.11 




CH.Y_hs gl|5868886 




338980 


CH22 DA59H1B.GENSCAN.2-4 




336553 


CH22 FGENES.841J0 


0.111 


330021 


CH.16_p2 gi[6671889 


0.111 


327579 


CH.03_hsgi|5867824 


0.111 


333099 


CH22_FGENES.79_4 


0.111 


337076 


CH22_FGENES.453-4 


0.111 


331388 AA456852 


Hs.43543 suppressor of white apricot homolog 2 


0.111 


306674 A1005542 


Hs.180414 heatshock70kD protein 10 (HSC71) 


0.111 


305949 AA884409 


EST singleton (not in UniGene) with exon hit 


0.111 


330748 AM19217 
333780 


Hs.15911 DKFZP586E1422 protein 
CH22 FGENES573_2 


0.111 
0.111 


323676 AI702835 


EST cluster (not in UniGene) 


0.111 


308952 AI868157 


Hs.224226 EST 


0.111 


309338 AW026946 


Hs.1 81 1 65 eukaryotic translation elongation factor 1 alpha 1 


0.111 
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CH.X hsgl|6381976 


0.112 
0.112 






CH22_FGENES.173_3 

EST singleton (not in UniGene) with exon hit 


0.112 






CH22_FGENES.728_2 


0.112 


5 




CH22 FGENES.250 12 


0.112 




302173 AI417947 


Hs.14068 ESTs 


0.112 






CH22 FGENES.571_25 


0.112 




328042 


CH.05 hsgi|5902482 


0.112 




336512 


CH22_FGENES.834_7 


0.112 


10 




CH.07 hs gi|5868486 


0.112 




311265 AW205118 


Hs.199214 ESTs 


0.112 




32321 B AF131846 


Hs.1 3396 Homo sapiens clone 25028 mRNA sequence 


0.112 




302002 AF013956 


Hs.1 23085 chromobox homolog 4 (Drosophila Po class) 


0.112 






Hs.1 52448 ESTs; Moderately similar to MULTIFUNCTIONAL PROTEIN ADE2 0.112 


15 


312531 AI937242 


Hs.1 76590 ESTs 






322246 AW384710 


Hs.1 25258 ESTs 








CH22_FGENES.241_5 








CH.02_hs gi[61 17815 








CH22_FGENES.839_1 




20 


338682 


CH22_EMAC005500.GENSCAN.472-1 








CH22_FGENES.398J 


0.113 




322533 T59538 


EST cluster (not in UniGene) 






306873 AI086929 


EST singleton (not in UniGene) with exon hit 


nil? 


25 




CH22_FGENES.679_2 




303898 T23215 


EST cluster (not in UniGene) with exon hit 








Hs.1 87226 ESTs 








CH22_FGENES.506_5 






333607 


CH22_FGENES.216_2 




30 


305549 AA773530 


EST singleton (not in UniGene) with exon hit 






CH22_FGENES.249_4 


0.113 






CH22_FGENES.376_3 








CH22 EM:AC0055DO.GENSCAN.233-18 








CH22_FGENES.206_2 


0.114 


35 




CH22_BA354l12.GENSCAN.2-3 






CH22_FGENES.777-1 






309101 AI925108 


EST singleton (not in UniGene) with exon hit 








CH.07_hs gi|5868477 






323999 AI537333 


Hs.252782 ESTs 








CH22_FGENES.173_2 




40 




CH.16_p2gi|61 65200 








CH.17_hsgi|5B67230 








CH22_FGENES.633_4 








CH22_FGENES.77-1 


0114 


45 




CH.04_hsgi|5B67890 


o!l14 


338842 


CH22 DJ246D7.GENSCAN.7-1 


0.114 




306534 AA991487 


EST singleton (not in UniGene) with exon hit 


0.114 






CH22_FGENES.266J 


0.114 






Hs.227150 Homo sapiens LSFR2 gene; last exon 


0.114 






Hs.5337 isocitrate dehydrogenase 2 (NADP+); mitochondrial 


0.114 


50 




Hs.77515 inositol 1;4;5-triphosphate receptor; type 3 


0.114 






CH.01 hsgi|6552411 


0.114 






Hs.1 95188 glyceraldehyde-3-phosphate dehydrogenase 


0.115 






CH.16_hsgi|5867087 


0.115 






CH22_FGENES272-11 


0.115 


55 


305858 AA363103 


EST singleton (not in UniGene) with exon hit 


0.115 






multiple UniGene matches 


0.115 




336158 


CH22_FGENES.707_2 


0.115 




327866 


CH.06 hsgi|5868131 


0.115 




339157 


CH22 DA59H18.GENSCAN.67-3 


0.115 


60 




CH22 BA354I12.GENSCAN.8-3 


0.115 




336129 


CH22_FGENES.701_17 


0.115 




333684 


CH22 FGENES.249_2 


0.115 




309618 AW190162 


Hs.1 84776 ribosomal protein L23a 


0.115 


65 


312926 AA954097 


Hs.127523 ESTs 


0.115 


302640 AB03569B 


EST cluster (not in UniGene) with exon hit 


0.115 




328968 


CH.08 hsgi|6456775 


0.115 




327902 


CH.06_hsgi|586B158 


0.115 




321927 AJ223366 


EST cluster (not in UniGene) 


0.115 




335962 


CH22 FGENES.651_4 


0.115 
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CH22 FGENES.460 1 




330535 U11872 


Human interleukin-8 receptor typo B (IL8RB) mRNA 






splice variant IL8RB1 






CH 07 hs gi|5868227 
CH22 FGENES.452 16 






CH 07 hs gi|5B684B2 












CH22 FGENES.819-1 


0116 


305079 AA641329 








CH 0l' hs gi|601 7017 


0116 


333918 


CH22rFGENES.296_7 


0/116 


333600 


CH22_FGENES.213_2 






CH22 FGENES 623 6 


0116 




CH22 FGENES 171 4 






CH 04 hs gi|5867872 






CH22~FGENES.161 6 






CH.20_hS gi|6552460 






CH 21 hs gi|5867664 


0117 




CH22 r-GENES.795-3 






CH22~FGENES.824 10 




321964 AL079687 


Hs. 171 065 ESTs 






CH22 FGENES 590 2 


0117 


308164 AI521574 


Hs.181165 eukaryotic translation elongation factor 1 alpha 1 


0117 


300341 H45095 






300154 AI245127 


Hs.179331 ESTs 




306295 AA937331 






329670 








CH22 FGENES 583 6 


0117 


307845 A1363450 






330401 D28383 








5 cap to the start codon) 






CH 21 hs gi|6682520 


0117 








331083 R17762 








CH X hs gi|6017060 






CH22_BA354l12.GENSCAN.27-3 




331974 AA464518 


Hs 99616 ESTs 


0117 




On^_fcM.Al»UU00UU.UfcNoWUN.40q-ii 






CH.06_p2 gi|2905881 






CH.Q9_p2 gi|3056622 




305377 AA715714 


Hs.181357 laminin receptor 1 (67kD; ribosomal protein SA) 






CH22 FGENES 79 12 


0117 




CH22_EM'.AG00550Q.GENSCAN .392-4 






CH 01 hs gi[5902477 


0117 


301970 AB028962 


Hs. 120245 KIAA1039 protein 










330612 X15673 


Hs.93174 Human endogenous retrovirus pnc.1 (cHV9) 












CH.06_hS gi|5902482 




329976 


CH.16_p2 gi|4878Q63 




320952 AA897432 


Hs.130411 ESTs 


0118 


305621 AA789095 








CH22_EM:AC005500.GENSOAN.34-3 




333626 


CH22_FGENES.224_2 










328803 


CH.07 hsgi|6004475 


0.118 


325922 


CH.16 hsgi|5867122 


0.118 


334489 


CH22 FGENES.397J 


0.118 


320638 R54766 


Hs.101120 ESTs 


0.118 


321932 AA569229 


EST cluster (not in UniGene) 


0.118 


336958 


CH22_FGENES.367-1 


0.118 


332082 AA600176 


Hs.1 12345 ESTs 


0.118 


306004 AA889992 


EST singleton (not in UniGene) with exon hit 


0.118 


336803 


CH22 FGENES.194-1 


0.118 


309107 AI925823 


EST singleton (not in UniGene) with exon hit 


0.118 


336859 


CH22 FGENES.293-9 


0.118 


337935 


CH22 EM:AC005500.GENSCAN.85-6 


0.118 


326492 


CH.19_hsgi|5867422 


0.118 



WO 02/30268 



PCT/US01/32045 




CH.16_p2gi|6671908 
CH.11_hsgi|5866878 
CH22_FGENES.523_7 
ESTS 

CH.X_hsgi|5868711 
ESTs 

CH22_EM:AC005500.GENSCAN.304-2 

CH.17_hsgi|5867178 

CH22_FGENES.426 3 

CH22_FGENES.73 5 

EST duster (not in UniGene) with exon hit 

collagen; type I; alpha 1 



0.119 
0.119 
0.119 
0.119 
0.119 
0.119 



EST singlston (not in UniGene) with exon hit 

CH22_FGENES.560_4 

CH22_FGENES.717_11 

EST singleton (not in UniGene) with exon hit 

CH22 FGENES.679 9 

EST singleton (not in UniGene) with exon hit 

EST 

EST singleton (not in UniGene) with exon hit 

CH22_FGENES.746_3 

EST cluster (not in UniGene) with exon hit 

CH22_FGENES.596_5 

CH.08 hsgi|6458765 



0.12 
0.12 
0.12 



323825 AI793080 



32G122 CH.17Jsgi|5867194 
335310 CH22_FGENES.532_3 
335453 CH22_FGENES.562_13 
305103 AA643329 Hs.111334 ferritin; light polypeptide 
337284 CH22 FGENES.667-2 

CH22_FGENES.758-4 
5 ESTs 

EST cluster (not in UniGene) wi 



337418 

313073 AI963740 

303759 AW504164 
300017 

M33197 

316725 AW135084 

330738 AA293153 



ESTs 

CH22 BA354I12.GENSCAN.16-6 0.12 
CH22_EM:AC005500.GENSCAN.116-3 0.12 
CH22_FGENES.137_1 0.121 
EST singleton (not in UniGene) with exon hit 0.121 
CH.X_nsgi|6381976 0.121 
multiple UniGene matches 0.121 
CH22 FGENES.129_2 0.121 
ESTs; Weakly similarto NEUTROPHIL GELATINASE-ASSOCIATED 
LIPOCALIN PRECURSOR [R.nowegicus] 0.121 
Rho guanine nucleotide exchange factor (GEF) 1 0.121 
eukaryotic translation elongation factor 1 alpha 1 0.121 
CH22_FGENES235_2 0.121 
EST cluster (not in UniGene) with exon hit 0.121 
CH.14_p2gi|6002090 0.121 
~' " 0.121 
0.121 
0.121 
0.122 
0.122 
0.122 
0.122 



60 315303 AA780564 Hs.189053 E 



AFFX control; GAPDH 
Hs.1 27264 ESTs 

nuclear receptor co-represser 2 
CH22_FGENES.829 25 
CH22_FGENES.647_3 



.14-3 

CH22_FGENES.466_2 
EST cluster (not in UniGene) 
CH22_DA59H18.GENSCAN.38-2 
CH22_EMAC005500.GENSCAN.306-2 



CH.12_hs #017034 

CH22J 

Hs.258712 EST 



0.122 
0.122 
0.122 
0.122 
0.122 
0.123 
0.123 
0.123 
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329446 CH.Y hsgl|5868886 0.123 

303326 AA229433 Hs.222634 ESTs7Moderately similar to ubiquitin-like protein / 

ribosomal protein S30 0.123 

309067 AI916313 Hs.212788 EST 0.123 

5 317464 AA9B8472 Hs.130463 ESTs > 0.123 

328755 CH.07_hsgi|5868301 0.123 

326036 CH.17_hsgi|5867178 0.123 

327208 CH.01_hsgi|5867447 0.123 

326124 CH.17_hsgi|5916395 0.123 

10 327509 CH.02_hsgi|6117815 0.123 

338398 CH22_EM:AC005500.GENSCAN.336-5 0.123 

304652 AA527782 Hs.84298 CD74 antigen (invariant polypeptide of major 

histocompatibility complex; class II antigen-associated) 0.123 

335797 CH22_FGENES.612_6 0.1.24 

15 336714 CH22_FGENES.76-29 0.124 

327204 CH.01_hsgi|5867447 0.124 

331881 AA430672 Hs.123778 ESTs 0.124 

306971 AI126509 EST singleton (not in UniGene) with exon hit 0.124 

336174 CH22_FGENES.710_1 0.124 

20 336126 CH22_FGENES.701_13 0.124 

329129 CH.X_hsgi|6588026 0.124 

303049 AW407562 EST cluster (not in UniGene) with axon hit 0.124 

335778 CH22_FGENES.607_14 0.124 ~ 

336601 CH22_FGENES.389_2 0.124 

25 334340 CH22_FGENES.375_17 0.124 

337436 CH22_FGENES.767-1 0.124 

306013 AA896990 EST singleton (not in UniGene) with exon hit 0.124 

339213 CH22 FF113D11.GENSCAN.6-8 0.124 

335355 CH22_FGENES.541_2 0.124 

30 336552 CH22_FGENES.841_9 0.124 

336384 CH22_FGENES.822_4 0.124 
310485 AI2B6202 Hs.149800 ESTs 0.125 
335840 CH22_FGENES.622 3 0.125 
336444 CH22_FGENES.827_10 0.125 

35 315703 N36070 EST cluster (not in UniGene) 0.125 

327763 CH.05_hsgi|5867961 0.125 

336333 CH22_FGENES.822_3 0.125 

333496 CH22_FGENES.168_6 0.125 

328662 CH.O7_hsgi|6O04473 0.125 

40 338986 CH22_DA59H18.GENSCAN.5-1 0.125 

328311 CH.07_hsgi|5868371 0.125 

337241 CH22_FGENES.644-2 0.125 

336933 CH22_FGENES.350-7 0.125 

313483 AW294432 Hs.144252 ESTs 0.125 

45 326116 CH.17_hsgi|5867193 0.125 

330450 HG363-HT363 Epidermal Growth Factor Receptor-Related Protein 0.125 

307491 AI268539 EST singleton (not In UniGene) with exon hit 0.125 

331852 AA418988 Hs.98314 Homo sapiens mRNA; cDNA DKFZp586L0120 

(from clone DKFZp586L0120) 0.125 

50 330462 HG944-HT944 Dopamine Receptor D4 0.125 

304410 AA284508 EST singleton (not in UniGene) with exon hit 0.125 

336385 CH22_FGENES.822_5 0.125 
336793 CH22_FGENES.176-3 - 0.125 
326243 CH.17_hsgi|5867261 0.125 

55 327266 CH.01_hs gi|5867462 0.125 

320753 AF070579 Hs.181544 Homo sapiens clone 24487 mRNA sequence 0.125 

336960 CH22_FGENES.369-5 0.125 

329667 CH.14_p2gi|6272129 0.125 

328168 CH.06_hsgi|5868071 0.125 

60 336534 CH22_FGENES.839_16 0.125 

339289 CH22_BA354l12.GENSCAN.16-9 0.126 

309230 AI970747 EST singleton (not in UniGene) with exon hit 0.126 

339190 CH22_FF113D11.GENSCAN.1-2 0.126 

3370B6 CH22_FGENES.458-14 0.126 

65 319233 R21054 Hs.211522 ESTs 0.126 

339396 CH22_BA232E17.GENSCAN.6-8 0.126 

331930 AA449077 Hs.179765 Homo sapiens mRNA; cDNA DKFZp586H1921 

(from clone DKFZp586H192 0.126 

308099 AI475914 EST singleton (not in UniGene) with exon hit 0.126 
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338477 CH22_EM:AC005500.GENSCAN.373-5 0.126 

334286 CH22_FGENES.369_16 0.126 

317245 AI025039 Hs.131732 ESTs 0.126 

335249 CH22 FGENES.516J0 0.126 

5 333327 CH22_FGENES.138_20 0.126 

304240 AA0O9802 EST singleton (not in UniGene) with exon hit 0.126 

335464 CH22 FGENES.562_26 0.126 

335236 CH22_FGENES.515_8 0.126 

334154 CH22_FGENES.340_4 0.126 

10 309257 AI984183 EST singleton (not in UniGene) with exon hit 0.126 

310015 AI220122 Hs.201981 ESTs; Weakly similar to breast carcinoma-associated antigen 

[H.sapiens] 0.126 

328280 CH.07_hsgi|5868352 0.126 

305744 AA831819 EST singleton (not in UniGene) with exon hit 0.126 

15 327430 CH.02_hsgi|5867754 0.126 

328323 CH.07_hsgi|5868373 0.126 

333274 CH22_FGENES.123_2 0.126 

337193 CH22.FGENES.575-3 0.127 

334820 CH22_FGENES.437_2 0.127 

20 328706 CH.07_hsgi|5868270 0.127 

331228 W67267 Hs.174911 ESTs 0.127 

307205 AI192479 EST singleton (not in UniGene) with exon hit 0.127 

337123 CH22 FGENES.519-3 0.127 ~ 

326201 CH.17_hsgi|5867216 0.127 

25 335276 CH22_FGENES.523_2 0.127 

331202 TB1115 Hs.191136 ESTs 0.127 

330532 U03187 Hs.121544 interleukin 12 receptor; beta 1 0.127 

321235 N49521 EST cluster (not in UniGene) 0.127 

301743 F12605 Hs.204529 ESTs; Weakly similar to reverse transcriptase [H.sapiens] 0.127 

30 328175 CH.06_hsgi|5868073 0.127 

306407 AA971985 EST singleton (not in UniGene) with exon hit 0.127 

327145 CH.01_hsgi|5867548 0.127 

327649 CH.04_hs gi|5867899 0.127 

335142 CH22.FGENES.493J2 0.127 

35 333909 CH22_FGENES.295_2 0.127 

330608 X04325 Hs.2679 gap junction protein; beta 1; 32kD (connexin 32; 

Charcot-Marie-Tooth neuropathy; X-linked) 0.127 

330158 CH.21_p2gi|6580367 0.127 

320153 AF064594 Hs.120360 phospholipase A2; group VI 0.127 

40 314407 AA098835 Hs.224432 ESTs 0.127 

333383 CH22_FGENES.143_22 0.127 

320683 AI734242 Hs.244473 ESTs 0.128 

326233 CH.17_hsgi|5867232 0.128 

326598 CH.20_hsgi|5867634 0.128 

45 335174 CH22 FGENES.504_4 0.128 

319843 H29920 Hs.99486 ESTs; Weakly similar to aralarl [Usapiens] 0.128 

335458 CH22_FGENES.562J8 0.128 

332997 CH22_FGENES.58_4 0.128 

334188 CH22_FGENES.352_3 0.128 

50 329759 CH.14_p2gi|6048280 0.128 

330348 CH.09_p2gi|4544475 0.128 

326958 CH21_hsgi|6469836 0.128 

305263 MS79467 EST singleton (not in UniGene) with exon hit - 0.128 

337693 CH22_EM:AC000097.GENSCAN.78-14 0.128 

55 326812 CH.20_hs #682504 0.128 

333237 CH22_FGENES.108_7 0.128 

333699 CH22_FGENES.250_13 0.128 

311496 AI768677 Hs.209888 ESTs; Weakly similar to phosphatidyiserine 

synthase-2 [M.musculus] 0.128 

60 336499 CH22_FGENES.833_4 0.128 

320087 AF032387 Hs.113265 small nuclear RNA activating complex; polypeptide 4; 190kD 0.128 

309989 AI184186 Hs.197813 ESTs 0.128 

301490 AW298468 Hs.250461 ESTs 0.128 

337011 CH22 FGENES.427-6 0.128 

65 315052 AA876910 Hs.134427 ESTs 0.12B 

301611 W22172 Hs.59038 ESTs 0.128 

336497 CH22_FGENES.833_2 0.129 

302068 Y16280 Hs.132049 andothelin type b receptor-like protein 2 0.129 

334502 CH22_FGENES.397_18 0.129 
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304332 AA158884 

304522 AA465405 

312407 R46180 

310098 AI685841 

301119 AF142579 

309268 AI985821 

330989 H42142 



337140 

321758 U29112 
304619 AA515554 
312469 AA745289 



Hs.161354 ESTs 

EST cluster (not in UniGene) with exon hit 
Hs.62954 ferritin; heavy polypeptide 1 
Hs.226396 DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 19 

(Dbp5; yeast; homolog) 

CH22 FGENES.361-4 

CH.19_p2gi|6015202 

CH22_FF113D11.GENSCAN.6-7 

CH.21_hs gi|6004446 

EST singleton (not in UniGene) with exon hit 

EST singleton (not in UniGene) with exon hit 

CH22_FGENES.537-5 

EST cluster (not in UniGene) 
Hs.1 19598 ribosomal protein L3 
Hs.1 73088 ESTs 

CH22_DA59H18.GENSCAN.20-6 

CH.19_p2gi[6015202 

CH22 FGENES.138 4 

CH22_EM:AC005500.GENSCAN.121-1 
Hs.1 90028 ESTs 

Hs.128494 ESTs; Weakly similar to EG:95B7.2 [D.melanogaster] 
CH22_EM:AC005500.GENSCAN .398-1 1 



305836 AA858043 
339059 

305610 AA782319 
305852 AA862455 
327409 

312751 AI613089 



311159 AW025919 
322715 AA057230 
336441 



T singleton (not In UniGene) with exon hit , 
CH22_DA59H1 8.GENSCAN .30-5 
EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 
CH.02_hs gi|5867750 
Hs.1 64178 ESTs 



317252 AA905178 
329034 

305186 AA664230 



CH.16_hsgi|5867147 
Hs.1 97636 ESTs 
Hs.182135 ESTs 

CH22_FGENES.827_7 

CH22_FGENES.814_12 

EST singleton (not in UniGene) with exon hit 

CH22_FGENES.217_8 

CH22_EM:AC005500.GENSCAN.384-17 

CH.21_hsgi|5867684 

CH22_FGENES.717-1 

CH.20_hs gi[5867615 

EST singleton (not in UniGene) with exon hit 
EST cluster (not in UniGene) with exon hit 
CH22_EM:AC005500.GENSCAN.359-22 
CH22_FGENES272_5 



CH22_FGENES.541_12 



CH22_EM:AC005500.GENSCAN.13-19 
CH.11_hsgi[5866895 
Hs.1 67837 ESTs 

Hs.1 44879 dual specificity phosphatase 9 
Hs.1 54434 ESTs; Weakly similar to unknown [H.saplens] 
Hs.29797 ribosomal protein L10 
CH22_FGENES.571_5 



Hs.184221 ESTs 

CH22_FGENES.439 21 

CH22 FGENES.139_4 
Hs.130124 ESTs 

CH.X_hsgi|5868561 

EST singleton (not in UniGene) with exon hit 
CH22 FGENES.604_4 
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302143 H15270 


Hs 189847 putative neuronal cell adhesion molecule 


0131 




CH22 FGENES.465 3 




318994 C15110 


Hs.17802 ESTs" 


0 131 




CH22 FGENES.397 14 






CH22 FGENES.146~2 




329676 


CH.14_p2 gi[6272128 


0.132 




CH 01 hs gi|5867473 




305022 M627416 


EST sinQleton (not in UniGene) with exon hit 


0.132 


336B05 


CH22 FGENES.196-3 






CH22 FGENES.428 10 ' 




334761 T93657 




0.132 


339400 


CH22 BA232ElV.GENSCAN.7-6 


0.132 


330301 


CH.O6_p2gi|2905862 


0.132 


316822 AA827691 


Hs.129967 ESTs; Weakly similar to neuronal thread protein 










328020 


CH.06 hs gi|5902482^ 






CH.1 1 hs gi|5866875 




321163 AA209530 


EST cluster (not in UniGene) 


0132 


336393 


CH22 FGENES.823 5 






CH.16~hs gi[5867104 




305237 AA676286 








CH22^DA59H18.GENSCAN_8-6 " 


0.132 




CH.12 hs gi|5866920 






CH22 FGENES 304 7 
CH22 FGENES 562 8 




302286 R58438 




0133 




CH22 FGENES 496 3 






CH 01 hs gi|5902477 




308070 AI470948 




0133 


308311 AI581855 








Hs 208839 ESTs 












CH 07 hs gij5868373* 




320603 R51419 








CH22 FGENES 3 1 




314976 AA524725 


Hs.162108 ESTs 


0133 


303309 AL134164 


Hs.224868 ESTs 




320581 R39753 


Hs.170187 ESTs 






CH22 FGENES.302 2 




317992 AI733512 


Hs 130901 ESTs 






CH22 FGENES.36-5 ^ ^ 




336659 




0133 




CH22~DJ32i10 GENSCAN 6-10 




^ffi?7? 


Hs 181165 eukaiyotic translation elongation factor 1 alpha 1 


0133 




CH22_FGENES.183_2 


OlU 


316952 AW450033 


Hs 163312 ESTs 






CH22 FGENES.283 1 


0 134 




CH.07 hsgi|5868262 


















CH22~FGENES.5135 


0134 




CH22 EM*AC005500 GENSCAN 179-3 






CH22 FGENES 745-1 






CH.01 hs gi[655241 1 






CH.06hs gi|5868038 




323604 AI751438 


Hs 182827 ESTs" Weakly simitar to M" ALU SUBFAMILY SQ 






WARNING ENTRY!!!! 


0.134 


337591 


CH22_C20H12.GENSCAN.6-6 


0.134 


307018 AI140639 


EST singleton (not in UniGene) with exon hit 


0.134 


326896 


CH.21_hsgi|5867680 


0.134 


333479 


CH22_FGENES.163_5 


0.134 


337915 


CH22_EM:AC005500.GENSCAN.61-3 


0.134 


335110 


CH22_FGENES.494J8 


0.134 


333481 


CH22 FGENES.163.9 


0.134 


327512 


CH.02 hsgi|6117815 


0.134 


300096 AW328639 


Hs.83575 ESTs; Weakly similar to ZC328.3 [C.elegans] 


0.134 


330163 


CH.02_p2gi|6042042 


0.135 


335752 


CH22_FGENES.604_1 


0.135 


334857 


CH22_FGENES.443_1 


0.135 
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301872 H84730 EST cluster (not in UniGene) with exon hit 0.135 

337529 CH22_FGENES.823-29 0.135 

335734 CH22_FGENES.601_4 0.135 

337551 CH22_FGENES.847-8 0.135 

5 309078 AI920965 Hs.77961 major histocompatibility complex; class I; B 0.135 

335513 CH22_FGENES.571_28 0.135 

339078 CH22_DA59H18.GENSCAN.37-6 0.135 

321907 N56660 Hs.148722 ESTs; Weakly similarto large tumor suppressor 1 [H.sapiens] 0.135 

337189 CH22_FGENES.571-32 0.135 

10 329635 CH.12_p2gi|5302817 0.135 

308601 AI719930 EST singleton (not in UniGene) with exon hit 0.135 

305020 AA627248 Hs.2064 vimentin 0.135 

333894 CH22_FGENES.293_1 0.135 

322465 AA137152 Hs.3784 ESTs; Highly similar to phosphoserine aminotransferase 

15 [H.sapiens] 0.135 

305601 AA780975 EST singleton (not in UniGene) with exon hit 0.135 

332186 H10781 Hs.141051 ESTs; Moderately similarto!!!! ALU SUBFAMILY SB 

WARNING ENTRY 0.135 

327822 CH.05_hsgi|5867968 0.135 

20 310087 AI393914 Hs.160624 ESTs; Weakly similar to similar to CR16; SH3 domain 

binding protein 0.135 

328752 CH.07_hsgi|5868298 0.135 _ 

337611 CH22_C20H12.GENSCAN.19-4 0.135 

334470 CH22_FGENES.394_1 0.136 

25 335115 CH22_FGENES.496_2 0.136 

328730 CH.07_hsgi|5868289 0.136 

330350 CH.09_p2gi|3056622 0.136 

336971 CH22_FGENES.378-6 0.136 

308258 AI565612 EST singleton (not in UniGene) with exon hit 0.136 

30 326745 CH.20_hsgi|5867611 0.136 

335440 CH22_FGENES.560_3 0.136 

320257 AA330746 EST cluster (not in UniGene) 0.136 

328677 CH.07_hsgi|5868256 0.136 

329731 CH.14_p2gi|6065783 0.136 

35 315950 AA700553 Hs.206974 ESTs 0.136 

330049 CH.17_p2gi|4557182 0.136 

337070 CH22_FGENES.448-3 0.136 

304095 H11324 Hs.31059 EST 0.136 

309304 AW005527 Hs.232820 EST 0.136 

40 333458 CH22_FGENES.157_7 0.136 

329899 CH.15_p2gi|6563505 0.136 

322202 AI275056 Hs.200133 ESTs 0.136 

333991 CH22_FGENES.310_15 0.136 

318617 AW247252 Hs.75514 nucleoside phosphorylase 0.136 

45 310623 AI34158S Hs.195588 ESTs 0.136 

330489 M23323 Hs.3003 CD3E antigen; epsilon polypeptide (TiT3 complex) 0.136 

309646 AW194694 EST singleton (not in UniGene) with exon hit 0.136 

331068 R00071 Hs.191199 ESTs 0.136 

334285 CH22_FGENES.369_15 0.136 

50 332178 F13689 Hs.100725 EST 0.136 

305724 AA827603 EST singleton (not in UniGene) with exon hit 0.136 

303158 AL138110 Hs.8594 Homo sapiens mRNA containing (CAG)4 repeat; clone CZ-CAG-7 0.136 

334543 CH22_FGENES.403_8 - 0.136 

335384 CH22_FGENES.543_26 0.136 

55 336527 CH22_FGENES.839_8 0.136 

334951 CH22_FGENES.465_20 0.136 

325882 CH.16_hsgi|5867087 0.137 

305134 AA653159 EST singleton (not in UniGene) with exon hit 0.137 

307058 AI148709 EST singleton (not in UniGene) with exon hit 0.137 

60 331943 AA453418 Hs.17B272 ESTs 0.137 

331116 R44780 Hs.22634 ESTs 0.137 

306094 AA908877 EST singleton (not in UniGene) with exon hit 0.137 

333561 CH22_FGENES.180_18 0.137 

321439 H61962 EST cluster (not in UniGene) 0.137 

65 324594 AA497090 EST cluster (not in UniGene) 0.137 

337926 CH22_EM:AC005500.GENSCAN.77-4 0.137 

337353 CH22_FGENES.726-1 0.137 

331836 AA412295 Hs.104774 EST 0.137 

308981 AI873242 EST singleton (not in UniGene) with exon hit 0.137 
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329424 CH.Y_hsgi|5868879 0.137 

325829 CH.15_hsgi|5867052 0.137 

331845 AA416863 Hs.98183 ESTs 0.137 

333854 CH22_FGENES.290_13 0.137 

5 306591 AI000248 EST singlaton (not In UniGene) with exon hit 0.137 

328948 CH.OB hsgi|6456765 0.137 

338935 CH22_DJ32I10.GENSCAN.18-12 0.137 

325960 CH.16_hsgi|5867147 0.137 

328377 CH.07_hs gi|5868390 0.138 

10 308851 AI829820 EST singleton (not in UniGene) with exon hit 0.138 

314620 M424352 Hs.210586 ESTs 0.138 

337592 CH22_C20H12.GENSCAN.6-7 0.138 

338684 CH22_EM:AC005500.GENSCAN.472-3 0.138 

331800 AA400498 Hs.97543 ESTs 0.138 

15 304587 AA505535 EST singleton (not in UniGene) with exon hit 0.138 

333981 CH22_FGENES.310_4 0.138 

332452 AA040369 Hs.11170 SYT interacting protein 0.138 

305752 AA835278 EST singleton (not in UniGene) with exon hit 0.138 

311947 T65554 Hs.251591 EST 0.138 

20 333783 CH22_FGENES.273_5 0.138 

337406 CH22_FGENES.754-14 0.138 

327976 CH.06J1S gi|5868212 0.138 

325593 CH.13_hsgi|5866992 0.138 

339425 CH22_DJ579N16.GENSCAN.14-4 0.138 

25 304475 AA428879 EST singleton (not in UniGene) with exon hit 0.138 

309488 AW131104 EST singleton (not in UniGene) with exon hit 0.138 

337532 CH22_FGENES.827-6 0.138 

317234 AA904448 Hs.126368 ESTs 0.138 

312261 AA854425 Hs.144455 ESTs 0.138 

30 328927 CH.08_hs gi|5868500 0.138 

336424 CH22_FGENES.824_9 0.138 

326667 CH.20J1S gi|6552455 0.138 

325988 CH.16_hsgi|5867064 0.138 

31B446 AW300287 EST cluster (not in UniGene) 0.139 

35 338511 CH22_FGENES.834_6 0.139 

335204 CH22_FGENES.508_13 0.139 

303244 AA147472 EST cluster (not in UniGene) with exon hit 0.139 

330870 AA115804 Hs.187593 ESTs 0.139 

329376 CH.X_hsgi|5868859 0.139 

40 304703 AA563898 EST singleton (not in UniGene) with exon hit 0.139 

333653 CH22_FGENES.239_2 0.139 

306799 AI051696 EST singleton (not in UniGene) with exon hit 0.139 

304872 AA595289 EST singleton (not in UniGene) with exon hit 0.139 

330812 AA013001 Hs.60563 ESTs 0.139 

45 329568 CH.10_p2gi[3962490 0.139 

319210 AA253074 Hs.146261 ESTs 0.139 

334320 CH22_FGENES.374_5 0.139 

300860 AI916949 Hs.149748 ESTs; Weakly similar to weak similarity to oollagens [C.elegans] 0.139 

305866 AA864533 EST singleton (not in UniGene) with exon hit 0.139 

50 312943 AA984364 Hs.119064 ESTs 0.139 

330523 M99439 Hs.83958 transducin-like enhancer of split 4; homolog of Drosophila E(sp1) 0.139 

312708 AI076204 Hs.135440 ESTs 0.139 

309366 AW072970 EST singlaton (not in UniGene) with exon hit - 0.139 

303273 AA316069 EST cluster (not in UniGene) with exon hit 0.139 

55 317484 AW274696 Hs.143921 ESTs 0.139 

333239 CH22_FGENES.111_1 0.139 

307126 AI1B4951 EST singleton (not in UniGene) with exon hit 0.139 

316813 AA826505 Hs.124517 ESTs 0.139 

331746 AA281365 Hs.121640 ESTs; Weakly similar to KIAA0386 [H.sapiens] 0.139 

60 308558 AI700145 Hs.172182 poly(A)-binding protein; cytoplasmic 1 0.139 

310784 AW036142 Hs.159017 ESTs 0.139 

323831 AA335715 Hs.200299 ESTs 0.139 

307692 AI318342 EST singleton (not in UniGena) with exon hit 0.139 

310570 AI318327 EST cluster (not in UniGene) 0.139 

65 327934 CH.06_hsgi|5868184 0.139 

305232 AA670052 Hs.195168 glyceraldehyde-3i]hosphate dehydrogenase 0.139 

334756 CH22_FGENES.428_5 0.139 

331938 AA451867 Hs.99255 ESTs 0.139 

301393 AI474722 Hs.150898 ESTs; Weakly similar to KIAA0S44 protein [H.sapiens] 0.139 

246 



WO 02/30268 



PCT/US01/32045 



312005 T78450 


Hs.13941 ESTs 






CH22_EM:AC005500.GENSCAN.351-4 




331214 T90496 


Hs.16757 ESTs 




333601 


CH22_FG ENES.2 1 3_4 




323481 AA278449 


Hs.1 37429 ESTs 


nil 


336911 


CH22_FG ENES.344-4 




338157 


CH22_EM :AC005500.GENSCAN.209-5 


nil 




CH.05_hs gi]6531962 




319109 Z45662 


Hs.90797 Homo sapiens clone 23620 mRNA sequence 




334763 


CH22_FG ENES.428_1 2 




329384 


CH.X_hS gi|5863869 


014 


302996 AF054663 


EST cluster (not in UniGene) with exon hit 




323751 AW452656 


Hs.209324 ESTs 




329916 


CH.16_p2 gi|6223624 


o!l4 


301993 N49826 


Hs.18602 ESTs 


0.14 


338129 


CH22_EM:AC005500.GENSCAN.197-2 






CH.14_hs gi|5867028 






CH22_FG ENES.590_7 




331673 W72366 


Hs.40033 ESTs 




316807 AI018331 


Hs.172444 ESTs; Highly similar to transcription regulator [M.musculus] 




310743 AW449754 


Hs.1 58665 ESTs 


nil 


326941 


CH.21_hs gi|6004446 




328809 


CH.07_hS gi|5868327 




323855 AI653164 


Hs.1 28665 ESTs 


nil 


304705 AA564064 


EST singleton (not in UniGane) with exon hit 




325666 


CH.14_hs gi|6469822 
CH22_FGENES.265_6 




333747 

318287 AW015616 


Hs.143321 ESTs 


nili 


332972 


CH22_FGENES.51_5 




305704 AA825266 


EST singleton (not in UniGane) with exon hit 


nil! 


315699 AW1 82805 


Hs.1 89183 ESTs; Weakly similar to Nodi (H.sapiens] 




327296 


CH.Q1_hs gi|5867492 




336400 


CH22_FGENES.823_15 




321033 H26214 


Hs.20733 ESTs; Weakly similar to !!!! ALU SUBFAMILY SX 






WARNING ENTRY 


nil! 


316522 AI475995 


Hs.1 22910 ESTs 




335715 


CH22_FGENES.599_15 




335959 


CH22_FGENES.650_2 






CH22_FGENES.1 18_7 




337382 


CH22_FGENES.744-8 




322346 AA227618 


Hs.1 0882 HMG-box containing protein 1 




325378 


CH.12_hs gi|5866920 


oil! 




CH22_EM:AC005500.GENSC AN .390-1 




338460 


CH22_EM:AC0055Q0.GENSCAN.362-5 




315279 AW511138 


Hs.256581 ESTs 




314439 AI539443 


Hs.1 37447 ESTs 




333624 


CH22_FGENES.222_3 


nil! 


329237 


CH.X_hsgi|5868729 




330117 


CH.19_p2 gi|601 5201 




338017 


CH22_EM:AC005500.GENSCAN.134-1 


0.141 


337854 


CH22_EM:AC005500.GENSCAN.38-1 2 




329984 


CH.16_p2 gi|4646193 




305004 AA622328 


Hs.1 62762 EST 




302815 N40373 


EST cluster (not in UniGene) with exon hit 




327823 


CH.05 hsgi|5837968 




326753 


CH^0_hsgi|5867616 


0/142 


301201 AA904482 


Hs.197775 ESTs 


0.142 


334303 


CH22 FGENES.373 6 


0.142 


326453 


CH.19_hsgi|5867399 


0.142 


311050 AI864581 


Hs.215477 ESTs 


0.142 


308740 AI802711 


Hs.210337 EST; Weakly similar to aldolase A [H.sapiens) 


0.142 


331003 H63959 


Hs.142722 ESTs 


0.142 


338010 


CH22 EM:AC005500.GENSCAN.128-B 


0.142 


336326 


CH22 FGENES.812 4 


0.142 


318100 R44308 


Hs.242302 ESTs 


0.142 


320641 R55421 


EST cluster (not in UniGene) 


0.142 


325855 


CH.16 hsgi 5867067 


0.142 


330425 HG1728-HT1734 Non-Specific Cross Reacting Antigen (Gb:D90277), 


0.142 
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324583 AA425411 
326263 

331390 AA460341 



331919 AA446869 

312214 AI24E004 

323198 AW179174 

316107 AI204001 

301335 AA885317 



305903 AA873085 



15 337913 

301436 AA961061 



Hs.22581 ESTs 

CH.17Jisgi|5867267 
Hs.45008 ESTs 

CH22_DJ32I1 0.GENSCAN.10-1 6 

CH22 FGENES.79J 
Hs.1 19316 ESTs 
Hs.125187 ESTs 
Hs.7984 ESTs 
Hs.184014 ribosomal protein L31 
Hs.190511 ESTs 

CH22_FGENES.747-3 

CH.12_hsgi[6682452 

EST singleton (not in UniGene) with exon hit 
phosphodiesterase l/nucleotide pyrophosphatase 2 (autotaxin) 
CH22_EM:AC005500.GENSCAN.59-10 



Hs.174185 
Hs.131696 



CH22 FGENES.486_5 
338451 CH22_EM:AC005500.GENSCAN.359-39 
302777 AJ230640 EST cluster (not in UniGene) with exon hit 

20 330464 J03068 Hs.78223 N-acylaminoacyl-peptide hydrolase 
33098B H41411 Hs.33855 ESTs 
328939 CH.08_hsgi|6004481 

308015 AI4401 74 Hs.228907 EST; Weakly similar to GUANINE NUCLEOTIDE-BINDING 
PROTEIN BETA SUBUNIT-LIKE PROTEIN 
25 l2.3[H.sapiens] 

328504 CH.07_hsgi|5868471 

332599 AA402891 Hs.32951 solute carrier family 29 (nucleoside transporters); member 2 
335744 CH22_FGENES.601_15 
322394 AF077208 EST cluster (not in UniGene) 

30 323892 AL042661 EST cluster (not in UniGene) 

318443 A1939323 Hs.157714 ESTs; Weakly similar to NEURONAL ACETYLCHOLINE 

RECEPTOR PROTEIN; ALPHA-5 CHAIN PRECURSOR 

[H.sapiens] 



305993 AA889197 
335719 
45 325682 
327350 



Hs.33183 ESTs; Weakly similarto estrogen-responsive finger protein; 
efp [H.sapiens] 

CH22_FGENES.165_2 



55 321921 AF070619 
319346 T70147 
304265 AA062892 



EST singleton (not in UniGene) with exon hit 

CH22_FGENES.599_22 

CH.14_hsgi]6138923 

CH.01_hsgi|6249563 

CH22_BA354I12.GENSCAN.18-1 

CH.18_hsgi|5B67293 

CH.08_p2gi|6007576 
Hs.174131 ribosomal protein L6 

CH22_EM:AC005500.GENSCAN.1 64-1 

CH22_DA59H18.GENSCAN.18-7 

CH.05_hsgi|5867964 

CH22_FGENES.41-8 

EST cluster (not in UniGene) 
Hs.12024 ESTs 

EST singleton (not in UniGene) with exon hit 
Ks.250178 copinell 

CH.02_hs gi|6017023 

CH22 FGENES.513J3 

CH22_DA59H18.GENSCAN52-1 
Hs.33026 ESTs; Weakly similar to similar to Enterococcus faecalis 
TRAB [Celegans] 
Hs.201811 EST 

Hs.156015 Homo sapiens chromosome 1 9; cosmid R29381 
EST cluster (not in UniGene) with exon hit 
CH22_FGENES.757-2 
EST duster (not in UniGene) 
CH.Osgi|5868806 
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336857 CH22_FGENES.291-7 0.145 

332565 AA234B96 Hs.25272 El A binding protein p300 0.145 

318634 AI928Q98 Hs.156832 ESTs 0.145 

336318 CH22_FGENES.801_1 0.145 

5 310960 AI923551 Hs.170843 ESTs 0.145 

335346 CH22_FGENES.537_2 0.145 

331196 T65416 Hs.12826 ESTs 0.145 

337607 CH22_C20H12.GENSCAN.17-3 0.146 

331206 T84096 Hs.15284 ESTs 0.146 

10 301793 T60698 EST cluster (not in UniGene) with exon hit 0.146 

319590 AA210878 EST cluster (not in UniGene) 0.146 

311394 AI695374 Hs.256231 ESTs 0.146 

324773 M632554 Hs.163401 ESTs 0.146 

324841 AI142359 Hs.155316 ESTs 0.146 

15 332260 N70088 Hs.1 38467 ESTs 0.146 

329276 CH.X_hsgl|5868762 0.146 

335837 CH22_FGENES.633_1 0.146 

338294 CH22_EM:AC005500.GENSCAN.297-1 0.146 

336993 CH22_FGENES.409-4 0.146 

20 334135 CH22_FGENES.336_2 0.146 

326251 CH.17_hsgi|5867263 0.146 

337396 CH22.FGENES.749-1 0.146 . 

339167 CH22_DA59H18.GENSCAN.69-8 0.146 

316838 AW135418 Hs.161210 ESTs 0.146 

25 325313 CH.11_hsgi|5866865 0.146 

331047 N66918 Hs.32205 ESTs 0.146 

323915 AL043362 EST cluster (not in UniGene) 0.146 

302747 AF062275 EST cluster (not in UniGene) with exon hit 0.146 

306317 M947909 EST singleton (not in UniGene) with exon hit 0.146 

30 334399 CH22_FGENES.382_5 0.146 

326472 CH.19 hs gil5867404 0.146 

333061 CH22_FGENES.75_4 0.146 

337072 CH22_FGENES.448-5 0.146 

334328 CH22_FGENES.375_5 0.146 

35 327039 CH51_hsgi|6531965 0.146 

325576 CH.12_hsgi|6552443 0.147 

315935 AI0758O4 Hs.1 32660 ESTs 0.147 

319638 AA323758 EST cluster (not in UniGene) 0.147 

334501 CH22_FGENES.397_17 0.147 

40 338238 CH22_EM:AC0055(X>.GENSCAN.264-4 0.147 

308636 AI744063 EST singleton (not in UniGene) with exon hit 0.147 

336567 CH22_FGENES.843_6 0.147 

335819 CH22_FGENES.619_2 0.147 

336950 CH22_FGENES.361-8 0.147 

45 307055 AI148477 EST singleton (not in UniGene) with exon hit 0.147 

315134 AW504854 Hs.126714 ESTs 0.147 

335834 CH22_FGENES.621_1 0.147 

327870 CH.06_hsgi|5868131 0.147 

323802 AA332011 Hs.250138 protein phosphatase 2C; magnesium-dependent; catalytic sub unit 0.147 

50 329412 CH.X_hsgi|6682553 0.147 

323791 M333D68 EST cluster (not in UniGene) 0.147 

324126 AA385315 EST cluster (not in UniGene) 0.147 

327865 CH.06J1S gi|5868130 " 0.147 

333445 CH22_FGENES.154_2 0.147 

55 321302 M021351 Hs.158497 KIAA0724 gene product 0.147 

336744 CH22_FGENES.118-9 0.147 

323731 AA323414 EST cluster (not in UniGene) 0.148 

320289 H07989 EST cluster (not in UniGene) 0.148 

305488 M749000 EST singleton (not in UniGene) with exon hit 0.148 

60 305592 M780594 Hs.62954 ferritin; heavy polypeptide 1 0.148 

304094 H11295 EST singleton (not in UniGene) with exon hit 0.148 

325040 AW296368 EST cluster (not in UniGene) 0.148 

339034 CH22_DA59H18.GENSCAN.26-2 0.148 

334504 CH22_FGENES.398_2 0.148 

65 334778 CH22_FGENES.431_2 0.148 

320148 U77494 Hs.1 19687 RAN binding protein 8 0.148 

303584 AW1 73759 Hs.203401 ESTs 0.148 

325826 CH.15_hsgi[5867048 0.148 

331192 T55182 Hs.152571 ESTs; Highly similar to IGF-II mRNA-binding protein 2 [H.sapiens] 0.148 
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333166 
336548 




5 


337552 

338936 
331869 


AA382742 
AA428554 


10 


332865 

328436 
311158 
336942 
302262 


AI634864 
R53169 


15 


333296 
333365 
311706 
337109 


AW452392 
AW1 73300 


20 


333454 
334784 
333255 
337518 


AA489268 


25 


320651 
323437 
328761 
328787 
335261 


AA287567 


30 


300827 
339263 
337412 
334414 


R16689 


35 


310801 AW270980 
305216 AA669056 
314779 AA470122 




303342 


AW247361 


40 


337509 
306631 
302533 
336536 


AI001149 
L36149 




324666 
310173 


T32458 
AI767433 




333595 
335975 
306654 
335025 
328711 


AI003654 


50 


328274 
325505 






329541 
304955 
339103 


AA613504 


55 


329636 
310118 


AI203293 




303773 AA769074 
303153 U09759 



CH.14_hsgl|6381957 

CH22_FGENES.91 8 

CH22_FGENES.841_5 

CH22_C4G1.GENSCAN.1-4 
Hs.97151 EST 

CH22_DJ32l10.GENSCAN.19-6 
Hs.104894 ESTs; Weakly similar to fibronedin precursor [H.sapiens] 

CH22_FGENES.28_5 

CH.07_hsgi|6004473 

CH.07_hsgi|5868417 
Hs.250789 ESTs; Highly similar to similar to NEDD-4 [H.sapiens] 

CH22_FGENES.354-2 
Hs.246091 ESTs 

CH22_FGENES.132_3 

CH22_FGENES.142_2 
Hs.252854 ESTs 

CH22_FGENES.489-2 
Hs.1 90201 ESTs 

CH22_FGENES.157_3 



CH22_FGENES.118_3 
CH22_FGENES.814-7 
EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
CH.07_hsgi|5868302 
CH.07_hsgi|5868309 



CH22_FGENES.38_5 
Hs.1 06346 novel centrosomal protein RanBPM 

EST singleton (not in UniGene) with exon hit 
Hs.1 90261 ESTs 

CH22_EM:AC005500.GENSCAN.341-27 

EST cluster (not in UniGene) with exon hit 

CH22_FGENES.806-4 

EST singleton (not in UniGene) with exon hit 
Hs.248116 chemokine(C motif) XC receptor 1 



Hs.1 4285 ESTs 
Hs.1 70013 ESTs 

CH22_FGENES.211_2 

CH22_FGENES.652_9 

EST singleton (not in UniGene) with exon hit 

CH22_FGENES.475_3 

CH.07_hsgi|5B68271 

CH.07_hsgi|5868219 

CH.12_hsgi|6682451 

CH.14_p2gi|6468233 

EST singleton (not in UniGene) with exon hit 

CH22_DA59H18.GENSCAN.44-10 

CH.12_p2gi|5302817 
Hs.157489 ESTs 

CH.17_hsgi[5867184 

EST cluster (not in UniGene) with exon hit 
Hs.8325 mitogen-actwated protein kinase 9 



0.148 
0.148 
0.148 
0.148 
0.148 
0.148 
0.148 
0.148 



0.148 

0.148 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 ~ 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 

0.149 



0.149 
0.149 
0.149 
0.149 
0.149 
0.149 
0.149 
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TABLE 13A shows the accession numbers for those primekeys lacking unigenelD's for 
Table 13. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
5 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey CAT number Accession 



322050 24275J 
321439 1599424J 
20 321656 13653_22 



AL137589 AA423949 BE222949 BE222694 AI199615 AW8731 16 AI277950 AW044290 AW630096 
H61962 W01567 N75711 

BE259906 AA232518 M013359 AL035788 AW160822 BE337134 BE002954 BE391839 AW161565 AI878841 BE616458 
BE409981 BE387308 BE297436 BE315536 AA206924 R12012 AA214169 BE312812 BE387093 H11710 BE312009 
BE260569 AA343566 AA219526 R34757 AA219749 BE336733 AA219751 AW411099 AA232408 BE018716 BE398089 
AA206253 AA053487 AA1 14224 AV6558B8 AW732566 BE394087 AW732574 AA313442 BE333875 AA070548 BE259840 
BE019828 AW732341 AA299916 BE019253 BE018238 BE387109 AA232304 BE255589 AW732585 AA181436 AA308777 
AA07S802 AW732521 AA314526 AA226747 BE409513 AA206168 BE388292 BE298782 BE387086 AA305310 AV652723 
AA314918 BE615510 AW951763 BE398104 BE385195 BE407165 BE391336 BE390187 BE389189 BE540650 BE249884 
BE385985 BE274245 BE391124 BE260080 AA182600 BE512821 BE390O90 BE279398 BE279589 BE263454 BE515194 
BE293569 BE272531 BE388814 BE334659 BE271685 BE561043 BE27S44G BE302572 AW239076 AI750583 M376179 
AA112632 BE266324 BE256614 R13105 AA132286 BE29B305 AI220355 AA20S606 AA219527 AA219519 AW804310 
AA083286 BE171208 T1S693 AA338328 BE185868 M903024 T92162 AA33Q119 BE410404 BE314668 
AW576245 BE207878 AW299993 AI199558 AI285442 AW299994 AW394242 AW394184 
AI357412 AI870708 AI590539 W07459 

AW068287 AA31 0079 BE336702 AA35631 8 AA306059 AA346785 AW402633 AA31 121 0 AW402909 N76879 AW40291 3 
AW401920 AA321636 AA354474 C17297 C16938 AA31 1774 M29871 NM_002872 Z82188 AW405674 H94176 R89281 
AA214723 AI014482 AW949347 T27749 AW804226 AW79S964 AW404581 AF077208 NM_014029 W68830 W79652 
AA353375 AW575218 AA552192 AA521232 AA702695 AA033975 AW407827 AA829948 N94402 AW628604 AI523308 
N57605 AA641662 H42477 N52784 AI753478 AA768493 AA845729 W47391 N55270 AI0901 17 R89282 BE206172 
AA076650AA595650AI218931 BE049397AI433110 W74114H94277AI358627 AI085221 AI862819 AA835967 AW103905 
AI640644 M835S07 AA856887 AA694392 AW337542 AI52441 0 BE045500 AI440060 AI358801 AW028238 AW205248 
AI71 8264 R48S18 AA357358 AI695002 AA897549 AW08106S AI433360 AI810783 AI620963 Z82188 AA360224 
U291 12 AI6S6540 AI364875 AI656246 AI990940 

AA1 69345 AI762857 AI949997 AI809601 AI681948 AI221079 AW167404 AI347614 AI61 1090 AI023472 AI347683 AI027467 
AW591788 AI380665 AA835735 AA836654 AI244028 AW193159 AI500112 AI913722 AI738693 AI702308 M805365 
AI766842 

T59538 T59589 T59598 T59542 AF147374 



AA621975 AI587036 AA665743 AW2040Q3 AI692234 AI002242.AI692219 AW137282 AW268783 AW295910 AI308015 
AW301462 AI318288AI318575 AI318117AI345591 AI249650 AI246934 AI246864 AI246971 AW268311 AI249654BE041907 
AW732776 

N72324 N52825 W19526 BE1 43464 AA376060 

M83667 NM_005195 S63168 M83667 AW068039 AW630649 AI338577 AI018125 AI269878 AW242440 AI887823 AI342581 
BE222416 AI582847 AI65101 1 A1B60815 AI699574 BE550201 AI926996 AW665855 AI827752 AI761857 BE328168 
BE222451 AI762201 AW000929 AW007207 BE042962 BE551843 BE465373 AI279179 AI949945 BE551862 AW051667 
BE328076 BE222296 AW007229 AW772332 AI279801 AI934526 A1631938 AI770103 BE041412 AI41790O AI692655 
AI869943 AW270119 AI431739 A1703347AW770568 AW025473 AI701497 AI128026 BE328147AW203980 BE046793 
AW087704 AI674597 AI650732 AI813691 AI472092 AI695224 AI241217 AW207746 AI206840 AI271362 AI631788 AI911883 
AI914619 AI380585 AI767501 AI823759 AI564116 AI190991 A1377369 AI614122 AI221623 AI354793 AI081988 AI391740 
AI337435 BE467366 AI824347 AI565325 AI280038 AI640455 AI819744 BE467803 BE327524 AI149402 AI313187 BE219684 
AW611948AW665821 AI091260 AW044492 BE220366 AW025381 AW183264 AI694865 AI498474 AI129780 AI202028 
AI566792 BE220659 AI928040 AI830696 AI493021 AW612488 AI913152 BE042965 AI631837 AI693873 AI498925 AI768668 
AI401544 BE327023 AI693383 AI769874 AI744003 AW082273 AI686501 AI798177 AI9851 96 AI090033 AI432342 AIB8991 8 
AI638308 BE468080 BE219588 AI9121 19 BE21 9787 AW005392 BE326564 AI589039 AI860187 AI75B143 AI338168 
AI702936 BE221985 AI498727 AI918196 AI279735 AW771497 AI850133 AW237834 AW661759 AW0281 1 1 BE503416 
AI360180 AW61 1715 AI871777 BE045447 BE326444 AI266547 AI800237 AI823315 AI478368 AI264281 AI675841 AI690041 
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324019 262792J 
323437 189513.1 
307845 19804J0 
324126 272259.1 
309101 7570.1 



301373 368214J 

323665 54093J 

323676 220254J 

302086 23306.1 

323731 226193.1 

323791 232336.1 

325040 23854.1 

324430 312113.1 

323892 477253.1 

309488 1030131.1 

302251 27216.4 

302286 22717.6 

323915 110063.1 

324594 330528.1 

301737 65.1 



A1498018 AI554124 AI239893 AI864054 AI280099 AI192815 AI620465 AI080201 AW002057 BE500986 AI341131 AI818991 
AI566137 AI123403 BE219192 AW183844 AI499842 AW137971 AW138720 AW015526 AW138160 AW2431 63 AW138705 
AW139927AW140O06AW138810AW137450AW206970AW135419AW205974 M043494BE465106AW139955AI741112 
BE326942 M043506 AI079957 AI942432 AI392902 AI097047 AI470599 AA514553 AA984008 N47949 AIS541 14 AA884832 
AI796752 AI765290 AI301 155 AW47035B BE222764 AI823569 AI651 188 AI692695 AI476643 BE504307 AI767573 BE219719 
AI932249 AW467075 AI913633 BE221966 AI091025 AA969215 AI799810 AA931 170 BE048559 AI809606 AI138614 
AI739456 AI674605 AW772068 AI089236 AI625787 AI263418 AW008638 AI928389 AW628997 AI470010 A1914168 AI760003 
AI203050 AI334069 AI694788 BE045337 AI948659 AI912982 AI867131 AI1 92102 AI767583 AI347518 AI566005 AI625884 
AI215888 AI633904 AW182265 AW614357 AI128030 AI343685 AI914283 A1985003 A1823578 AI493053 AI380285 AI633895 ■ 
AI267880 AI538162 AI991552 BE219479 BE219296 AI302178 AW779296 AI913805 AI631 644 AI566772 AI985498 AI942289 
AI935659 AI339092 AI247432 AI686472 AI766886 AI017228 AI333272 AW301668 AI972218 AW082027 AI632974 AI474761 
AI766127 AW236578 AW000966 AI870734 AI222399 AI871249 AI703448 BE464210 AI768037 AI871585 AI767871 AI738757 
AI220732 AI681633 AI768783 AI684463 AI307339 AI263203 AW665264 BE463969 AI768786 A1439118 AI127913 BE218324 
AI672342 BE22Q052 AI796163 AI221662 AW197672 AW025300 AI769681 AW612448 BE219757 AW072420 AI669980 
AI830418 AW204353 AA047011 AA913868 AI739146 AI689954 AW470507 AW614835 AW302151 AW772372 A1762427 
AW339902 AW303370 BE464775 AW299818 AW236072 AW1 95060 AW274737 AW263062 AW183846 AI868894 AW300493 
AW172509 AW516876 AW593773 AW299474 AW303546 A1817323 AI823624 AI694005 AI934589 AI343479 AI861825 
AI962726 AI765845 AW080318 AI640227 AI763042 AI768903 AW235386 M738489 AW341293 AA588585 BE221732 
AI914179 AW61 1669 AI572789 AW194735AW236122AW236007AW612789AW197501 AW1 95046 AI797145 AI864423 
A1458934 AI342848 AI693227 AI912642 AI689993 AA932572 AA740269 AW470392 AW086020 AI221701 T69326 T70461 
AI765579 AI338263 AI431721 AI394249 AI186462 AI823571 AI953665 AI497954 AI761057 AI678228 AI640302 AI948742 
AA594626 AA8831 55 AI972682 AI804774 AI300407 AI433524 AA897341 AI401 1 75 AI291071 AA02121 3 AI126509 AI948955 
AI21 8835 AA903938 AA502610 AI498320 AA584267 AA935285 AI476253 AA489658 AA975053 AA715326 AA557139 
M126417 AA971455 AA557319 AI499738 M911438 AI913637 AA494506 N90793 AI990724 AA131667 AA128164 
M046840 AI262557 M131729 AA594926 T59467 AA436907 AA044630 A1589177 AI279237 AI880498 AI431822 AA708934 
AW612558 AI634069 W0361 0 AI192272 BE550862 AI400879 AA708507 AI128003 AI375308 AI271423 AI199552 AA125977 
AI366498 AA458662 AI694382 AA044527 AI636263 AI796270 T90146 AW014724 AI870812 AI948781 AA369965 AI094721 
AW271817AI262898AI244680 T69252AI934148AA046357W19109AA028157AW021924AA253491 AI189397 AI934388 
D58282 W21323 W24288 AI682972 AA293683 AA284566 AV659511 AA434184 H87089 AA040038 N57464 AA343709 
AW805815 R89837 

BE621320 BE266806 BE276582 AW516729 AF142579 AW451687 AK000069 AA325236 BE168997 W73105 M715365 
BE278873 AA808894 AA386371 AW517942 AW750993 BE140314 BE392384 BE621757 AA318192 BE548173 AW152607 
AW166898 AA352215 AW841506 T59802 AF147378 AA335719 AW956069 T59668 AA826362 AI961329 AI290469 
AW197375 A1805651 AA160748 AA581089 AI9888B9 AA581 100 AA501478 AI621069 AA468534 AA503715 AA658457 
AI144504 BE387827 M159380 
AW177009AI381610 

AA287567 AA2524D4 AW967735 AA287568 AA761222 AA865644 AA831245 
BE514807 R43224 AI363450 AA450226 AF030942 



AI340462 AI583268 AA079086 AI950777 AI301866 AI925108 AW876954 AW877000 AA525418 AA888549 AI934220 
AW380220 AA804858 AI927576 T61 151 AW384053 BE391691 AA533856 AA248400 T48202 N57156 R68346 R26020 
AL050332 W30806 H61369 AA092592 AA230324 BE271217 AW372903 T48772 AA358002 AA094302 AA559856 AW373308 
AW373315 AW373297 AW37331 1 AW373314 AW373309 AW877055 AW770140 AW379805 A1581S09 AW364144 AA078921 
AA715432 AA654210 AI004899 AA602209 W47464 AA506588 R26822 AU076528 AI535743 A1535704 AI535681 
AA402307 D60405 D61237 D59891 AW964877 AA325215 AI459739 N36070 N25658 AA083684 AW293368 AI761958 
AI741205 AI693175 AW873603 AI143269 AI187124 N25199 H19323 AI650842 AW316825 AA083842 AA935650 AW298404 
AI472001 AI648568 R17676 R41625 AI123237 R17677 AI206866 F36920 AI654713 F34084 M618029 AI915139 AW275194 
AW514577 D80420 AW149850 Z40953 AI867861 AA927547 AA974344 AI825793 AI635565 AI652157 BE504748 AW295759 
F16800 AWB39796 F01781 AA909730 AA984010 



AW248307AA313452AW951927AA355961 BE566080 

AI702835 AI758919 AI685405 AI952108 AI299207 AI400767 AW105389 AI952710 AA845312 AI784118 AI537315 
X57138 NM_003514 Z98744 BE253911 BE256314 A1095013 A1138475 
AA323414 AW664013 AI809377 AI276041 AW296883 AI798340 
AA333068 M331863 AA331838 AW962531 AA331442 

AW296368AA247632AK002030R15304 T08775AW975664AI186801 M730688 AW190918 AI141176 AW513211 
AI275071 AA988601 BE042933BE045713AW087176 
AA464018 AA464079 AA468142 
AA846318W15478AL042661 



R58438AA358612 

AL043362 M350031 AW751972 BE5491 18 
AA497090 AI351879 AI350914 

AI815981 AF287269 BE260960 BE263991 AA31 1733 F1 21 45 F07345 Z43604 T29948 H64102 Z4361 1 T35364 N40667 
AI909783 AW751045 AA160594 AI816064 AI307240 AI951554 AA641031 AA293045 AI942492 AI687077 R78689 H12368 
AA894728 AI124930 AI423498 AA777759 AA614585 AW071822 T66288 AI418558 H21480 AI335011 AI051726 AA293436 
AW302233 AW1 88628 N26393 AI076557 AI311022 AW451505 H62593 Z39666 H12315 AI761351 AI364142 F02935 
AW571491 T35366AI240745 H64151 AA503793 AA831948 AI627686 AI761531 F03591 F09782 
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301793 239325J 

10 303049 102592J 

301 BS3 19477J 

301872 27494.4 

301893 6561J 

15 



303181 74060_2 
302569 17513.2 
20 324893 4670J 



303252 149690_3 
303273 67758J 
40 302640 21194J 



R05735 BE349600 R37388 N79751 R10115 AA702039 AA836147AA505716 AI049661 AI499239 R54072 AI023394 
AA827710 W60285 W500O38 AI884786 AA827191 AA810075 AW005088 R70248 AI858560 AW078678 AA631306 H52839 
AW085835 AI656182 AA737178 AW136923 AA281028 AA570316 AA722871 AA362737 AI217268 BE242373 R01 113 
AA628946 AI394527 AW402308 AI361 110 AI917585 T99639 AA805326 N44577 AI394021 AW403385 T23949 AI497766 
T96602 AA834947 AI693908 Z33450 T92127 BE541896 AI933301 BE251540 BE252269 N50968 AI695531 AW575523 
AW296889 N93796 N89924 AI361804 AI085251 AA810694 BE303011 AA743784 R13478 AA358771 AA325294 AW964880 
BE258953 R541 16 AW881039 AW602593 
BE265837 AA340632 T96304 T96075 T72780 H51 978 R09868 

AW408042 AW407562 BE172835 BE396893 BE269184 AA045741 BE004187 AW751261 W74283 

BE263301 AI418863 NM_005194 X52560 AW328683 BE298869 D63161 

H84730T73262 

T80334 BE292758 AK000854 H16996 BE253691 R88508 AA357663 AW955288 AW579550 N98864 AA595201 AI742967 
AA602658 AI091433 AA813367 A1983217 AW298007 AI628490 AI708037 AI560654 AI032983 R88509 R38972 AI687783 
AI560153 AW874581 N69891 AA993617 H51180 AI269042 AI281358 AW591213 AI017724 AI262B59 H16997 R38991 
AI804355 AI868988 AI669525 AW023081 AL047848 
A1734009 AI263076 AW272255 AI792912 

AA452365 M351338 BE262590 BE262591 M074050 AA389667 BE161346 
AC004472 BE312721 BE273942 F11928 T65358 BE612432 BE261576 BE179884 

AA324119 AW246199 BE395368 BE261676 BE382334 BE394701 BE304548 T31940 BE396128 BE398019 BE296693 
BE379564 BE269460 BE397065 Z42029 BE305028 AK000549 BE536182 BE314372 AW393349 T50987 AA069735 
BE386997 AW381699 T51050 W95025 AA477678 AA348306 AW956831 AW06291S AL040397 BE305160 AA315419 
AW249929 AA295944 AI635946 AI870259 AI951 125 AW028250 AI8851 84 AW8731 13 AI077544 AW025091 AI817594 
AI401718 AW008245 AI499064 AA599687 AI016890 AA765638 W93340 AA588708 AW51 9173 R51917 AA676778 AI084871 
AA687684 AI860840 AI81 1921 AW514730 AA477551 N78845 AA779894 AA778559 AI968953 T1 6188 T32828 AA991426 
AI474472 AI473542 AI828972 AW247906 AA977415 AW591489 AA876008 AW191893 AA074278 AW874099 Z40196 
AW083515 F01544T55984 AI290413 AI972167 AI365049 T36028 AI042568 BE560076 W171 19 AA196376 T47999 R54309 
AK001269 AL354613 AA147472 AA490803 BE2Q7628 AW8161 13 AA035574 AW503392 AA299910 AW750305 BE079539 
BE079484 BE512838 AK001593 AW968772 AW967440 AW206280 AA251270 AI627886 AA303599 AA147473 BE206616 
AA490611 AA715039 AW590866 AW590447 AI864512 AA204731 AA894480 BE001136 AA612785 AA237035 AA149960 
Z44257 R12986 AA448446 AI734041 AA422167 BE220551 R66041 R32927 R32942 AA258773 AW386142 R53730 N54624 
AW880296 AA253485 AW954441 H98989 AW614348 AI654838 AA779793 AW237213 N66635 AI186812 AA947479 
BE15801 1 AI859480 AW805579 N52010 M806305 AI628445 AW270990 AA778165 AA149949 AI650728 AA749108 
AA687257 AI261661 AA747442 AA481351 AA206339 M903407 AW473306 AI688930 AA262261 AA448310 AA748820 
AI347430 BE465692 R32839 AW510564 M436408 AA257971 AA253362 AA938330 AA513150 AA976840 M6871 17 
AI281547 AA046243 R32825 AI631554 AW139818 AI244536 R52946 AW235443 R40183 AA299909 AA811958 AI302918 
Z40213 BE158047 BE158060 AA767245 AW748159 AW500735 AA094074 
AW393348 AW393350 AW386713 AW384705 

AA316069 BE274224 AL120803 BE170052 BE170039 AI906340 BE091310 AA491506 AW836675 AW8631 11 
AW973784 AW843642 AA557573 AA578088 AI125161 AA349349 AI372794 BE312586 BE312777 T32148 AW239077 
AI905357 Z42685 AW298772 R18578 AA780425 AA325971 AI372793 R10658 AA295021 AW885349 AW885288 BE271987 



65 302879 36555J 
318540 1018709.2 
302928 22118J 



AW802152 W79803 H12809 AA028951 AW367382 AA295247 H46355 AA905620 R54564 H12765 AW950608 AA028952 
AA366908 AI085652 R43207 R77954 AI672848 T28547 M427734 AA572853 AA769934 AI2421 08 R00020 R02059 R1 0659 
AI185270 AI041890 NMJMOOSO X66403 F03854 AI652442 AI766431 AA976913 AI989882 AM71024 AI802727 AI824112 
F02169 AI890843 BE250876 BE252859 AL157418 R78326 
AW068570 AW247361 AA252638 AI751982 BE260758 BE293073 AW293303 

AB040951 AK002094 AA676593 W44644 N42376 Z45942 AW841844 BE541378 AA358274 AA213391 T88771 NM.015493 

AL117489 N88248 N31714 N36273 N31721 AW576263 AA449380 AI366135 AA551576 AW149789 Z17418 AW474331 

AA056181 AI753611 AA046428 AA488007 AA30O764 N44732 AA377697AA346752AA485787AA894546AA1 15295 

AA299914 R88096 M367342 AW884666 W84522 M426325 AI983849 AA873315 AA873307 AI355170 AA534678 

AA969227 AI127202 AW083323 AI338244 AW020877 AA780019 N33426 AW069314 N63079 AI926527 M115270 

AW886601 AI357402 AA599312 AI460358 AI926969 AA429402 N33197 AW886733 R88205 N52803 AW021983 AA213392 

BE139656 Al 1 42383 AA427844 AA954743 AA233622 AW073382 AA426326 AA493560 AA425133 N24819 AI419516 < 

AI571515 AI1 47373 AI628677 AI214877 AA992123 H71599 M029095 AA622262 AW1 1 7398 AW275286 AI91 1337 

AA864950 T94173 AI475634 AI701411 AI287696 T94091 AA505746 AI184310 AI350967 AI083596 W74274 AI954381 

AI832767 AI368443 AA195578 AW874416 AI005421 AW014339 M908S60 AI350791 AW241382 AI473104 AI275186 

AA515528 AA194897 AA782901 AW0B9414 F20248 AM2601 1 AI305169 AI832109 AI570082 AW072984 AI492474 

AA919076 AL049024 W79889 N42400 AA625435 AW953887 AA233420 

AW779971 AW300287 AW152002 AW069505 AI866447 AI298231 AI146920 AI692267 AI872876 

BE397032 AJ292529 N40373 N34073 AA321112 AW959902 AA258103 AW860213 BE549059 BE296027 BE296657 

AA300789 AI971491 AW513665 M909530 AI951045 AW058103 AI971506 AI061239 AA600054 AI000807 AA969975 

AA281492 AW593654 AA321 1 1 1 AW298633 AI278754 AI863862 AI285506 AA989727 F331 14 T16079 AI762625 AI492103 

AW770346 AW026768 AI468710 AI499987 AA310412 AA622784 M642297 AI866427 

H11802T66097AF042831 

R42185 AW939055 T30280 Z43356 R54166 

AA93B905 AA574056 AA714466 AI805592 AI1 23431 AA229723 AA620759 AI004450 AW299820 AI949299 AW874308 
AA626037 A974112 AA931563 AF073924 AA995769 AI766441 AI367730 AI081342 AA235800 AA235801 AI138970 
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M719797 AA759343 X89673 AA759344 AA312909 X87825 Y1 0529 AC006271 AA758739 BE501015 AA909905 AF065857 
AC006271 AA970044 

AA158883 AA171835 M187049 AA143546 BE299538 BE614280 BE62170S BE299684 BE619550 BE613099 BE619558 
BE514331 BE617716 BE612920 BE615742 BE258739 BE621539 AI434511 BE546696 BE614324 BE379359 BE250106 
BE250681 BE299592 BE300272 BE6168Q5 BE397385 BE562024 BE271246 BE250556 BE280311 BE561995 BE618755 
BE276126 BE546275 BE311547 BE262155 BE281082 BE513087 BE546891 BE514289 BE397389 BE267442 BE545455 
' BE614483 BE293447 BE270710 BE281071 BE267458 BE542Q95 BE262701 BE513634 BE5481 16 BE299546 BE619604 



BE538868 BE250081 BE277706 BE410127 BE619445 BE250753 BE304969 BE616348 BE546878 BE5449S2 BE410346 
BE267256 X17206 NM_002952 BE304541 BE619171 BE259655 BE549186 BE314944 BE613101 BE378069 BE621 110 



BE535432 BE279917 BE312626 BE5311 18 BE378744 BE27S370 BE250195 BE409980 BE274432 BE266637 BE279321 



BEB19517 BE388097 BE284025 BE618945 BE614758 BE312249 BE294359 BE531121 BE622300 BE615109 BE544354 
BE61499B BE393239 BE297520 BE393221 BE278818 BE279309 BE265476 BE61S772 BE615185 BE265144 BE249837 
BE312230 BE407843 BE253884 BE407645 BE615804 BE619058 BE559512 BE383249 BE61 3497 BE294351 BE295062 



BE253369 BE256997 BE269482 BE264959 BE279072 M662160 BE280733 AA85842B BE561308 BE267285 BE561422 
BE563181 BE304614 BE295437 BE619424 BE275863 BE394315 BE408109 BE541866 BE253772 BE618236 BE535261 
BE296490 BE278212 BE563154 BE257245 BE262274 BE513032 BE378567 BE394152 BE618947 BE269302 BE546516 
BE536792 BE615187 BE261 186 BE615367 BE6192B9 BE261 184 T49376 AL031671 BE273400 BE563457 BE545597 
BE615169 AA150323 M158723 M079033 BE313333 M160100 BE271115 BE294302 BE273051 BE273048 BE622390 
AA837947BE387721 AW973277 AA803731 BE280792 AA160444BE256723AI74S920AA643017BE549441 BE293858 
AW975249 AI620819 AW089494 AI434549 BE305231 AA081262 BE280101 AA522507 AI950880 AA187460 BE386860 
AW859229 BE170489 BE620149 BE54821S AA316696 AA484426 AIS67740 AA160605 AW939805 AA089573 BE300194 
BE391331 AW975419 H26808 BE545544 BE615974 AW800241 BE616222 W17343 BE387865 T53697 C03943 BE617637 
BE315130 T52942 T50588 N74693 M187107 T59919 AW797397 AA206447 AA854619 T57175 A1570295 AW517964 
AA158269 AI282220 W25297 AI580710 BE262453 A1185868 AA526485 AI288051 AIS82513 AA100675 AW615567 
BE395354AI472725BE314881 BE621281 N99921 AI282689 AI432725 AW732011 AA872254BE205807 T59435AI282712 
AA650505 AI004374 AA7252B0 BE313161 T60173 AI371260 BE385641 AW751812 AA078827 AI491858 AI433622 
AA219118 AI002092 AA996003 M064604 AI250287 AI304397 AI453213 AA653630AI524573 AI440306 H48802 AA157843 
AA71S629 AW973788 AA932493 AI347563 AA1 81 309 T67880 M643033 AW467498 AA1 15904 AA93541 0 AA483032 
AA084568 W25246 AI567588 AA155732 AA158614 AA888319 AA158568 AA188422 AI309183 AA084817 M157995 
AI859659 AA188008 AI287379 AI540675 AA085212 AW028391 AA173297 BE256792 AA182854 BE378771 BE538571 
AA079037 BE281597 M643926 W81011 AA159344 AA320691 AA877S97 T57107 AW263819 AI690413 AI619605 AI687579 
AA970560 AI368942 AI927104 AW419220 AI620051 AA1 28490 AA1 20825 M079520 M199648 AW188403 BE045224 
AW265533 M074338 AA102685 AW779399 AA192451 M1 82771 AW366812 BE281418 AA21 1094 M131073 M487924 
AW674848 AI568103 M171934 F30349 AW088785 AA581370 AA205482 AW352296 AW517565 AI376249 M158884 
AI340509 T59965 M085193 M071570 AI874045 M852755 BE045217 AW189428 AA21 1 141 M652134 AI497729 
M994817AI811459BE535857AW769697AW167892AW149305AI864981AW272126AW023245AI439266AI953196 
M160912 AI718560 BE537547 AA501448 M069308 L07393 M3530O7 M079235 AI539140 M740154 W58341 AA888403 
BE299000 M196413 BE613327 BE261523 M866599 AW844713 AI691159 AI079975 AW327479 BE180731 M984805 

303701 1155179J AW500732 AW504061 

303759 447287J M774672 AW504164 

303773 356632 J M769074AA570769 AA808585 AA808682 

303778 174437J AW505368 AA21B610 F11852 T65345 AA397806 

303784 414659J BE29771 1 AW505574 M704983 

303845 5021 1_2 F07942 T08033 

303898 162688 3 BE386266BE148823 T23215AI906290AA299906BE207197AW074114AI760368AI005358AW662201 M188988 

AI690711 M775103AW072931 AI684269AW1 29364 AW61 5634 AI049941 AW874040 AI352633 M1 88989 AI287775 
AA868774AA599660 

20121 452027J M780365 M909233 AI275542 

319590 171338J AA210878 M215684R11101 

305186 17456J M13560 AA336951 M161015R72814 T69687 R75705T61319M158454 R50579T56649 AI214156T70375 R31655 

H64997 AW800487 H49110 M634206 H42384 H21783 AI560152 M664230 H42302 R48708 M013277 T61901 T92417 
AA875985 T61962 T63055 AA430725 M458964 M578746 AI582385 T63000 AI499875 H64998 A/ "" 
AI865211 AI439714 AI224059 AI249917 T59258 AA477806 M715834 M916120 R38304 R35899 Rf 
AW516728 T54642 M079866 H27555 AA455820T63919 R79450 AI431241 M937349 M127213 M421729 H61196 
T63894 M013050 M079133 W96364 AA487926 AI762796 H26377 AI433386 AI855423 AW371475 R98189 M643978 
AI718204AW381954AI862735 

319638 226485 J AA323758 R1 2731 R1 4082 

320257 163534J R17531 AW960899 M338366 AW673294 BE047729 BE047722 M330746 AW841 797 H05030 AI142105 R12654 

320289 115941J H07989 AJ239462 H24544 M078369 R74153 

304703 3397L42 BE512926 BE304794 M129140 M052922 M092258 BE378058 BE615391 BE615218 BE616188 AI214126 H05675 

W56857 AI028525 BE617241 BE531271 AW856227 T56489 M322005 AW794148 AF170577 BE615738 AA005138 L76930 
L76932 L76933 X95410 AW389462 BE563092 AW997937 AA263158 AI520992 AW947350 AA522535 AW945921 AV653776 
AW884835 AW947338 AI687178 AW945799 AI905627 AW948449 AV653751 AW945924 AA563898 AW945810 AW945832 
AW371449 AW945864 AW948447 AW945910 M643002 AA522680 AA522715 AA578840 AA523279 AA826150 AW945809 
AW405998 AA551909 R23173 AA595545 AW389497 AI933770 AI125053 AI471803 AW795856 AW796937 W30675 H70317 
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306051 19085_3 



321163 171122J 
321235 1102181J 
320603 4297J 



320641 185591J 

320651 58648.1 

321325 28266J 

305704 464759_-1 

322011 23158J 

306407 

306454 

306516 

306518 



H68296 T59240 AA397650 H59852 M938072 AA978010 R35643 T89735 AW361585 AW196153 AI538069 M604540 
A1434259 R49181 T58717AW062486 AW796966 AI648384 R77733 AI623502 BE171342 BE171303 R35658 AW974883 
AW1 49898 AI500045 AI540710 AI540392 AW009172 AW277199 AI371312 AI500096 AI470297 AW372940 AW844562 
AW844560 AW797965 AI691146 X07062 AW799199 H60666 AA837684 AF130734 T25952 AI933771 AI914860 AW391925 
AW793843 AW795012 AW366709 AW750987 AW750985 R35765 AW844942 AW750986 H64920 R34651 X86703 
BE018103 BE018083 BE293253 AW247033 BE207643 BE514793 BE183233 AA376427 AW273850 AW043786 BE439973 
AL045428 AI889050 AA026496 AI422924 AI8B4435 W96068 AA020872 F371 1 9 AA71 4378 AA021 1 07 AA01 1 141 AI554001 
AI375841 AI469097 AA335219 AW967315 AI692177 AA410448 AI568858 AA582647 AA026419 AA281639 AW515248 
AW007777 AA010840 AW188439 AI805423 AI148210 BE301590 AA744414 AA745392 AW167423 AA622659 AW000878 
AI432387 AA760930 BE047189 AA021 605 AV658045 AI093347 AA588594 H63143 M639556 AI308976 AA379270 
AA633407 AI874329 AI206484 AI493895 AI694103 AI249682 AA973765 AA872445 AI125446 AA287272 AW069761 
AA682569 AW009712 BE542774 R50167 BE301574 AA991202 AA502006 AI219819 AW074373 AA617996 AI521242 F25241 
AW615812 R16774 AA335218 AW673800 H26778 AI468557 AI886986 AI560759 AI460075 AA502968 AA503273 AA610680 
AA287274 AA554020 AA284889 AA916636 AW469457 AW273250 AW673708 AW512948 AL041071 AI446042 AA903535 
BE172441 AI282411 AW265021 M810799 AI559865 AA729332AW004611 AW129451 AA659019 BE208239 M610825 
H0351 1 BE383995 R1 6474 AA281701 AW009244 AA287424 AA558139 AW364081 

F08147 AW408359 AW949429 R23785 AW247442 AA305512 T29095 AA9O5130 BE246361 BE244981 AA220199 BE504058 
X80878 AA533727 M608601 AWO05964 AI811627 AI367037 AI277985 AI493719 AI277848 AAB54982 AW247298 AI216345 
AI041295 A1887378 M781241 AI674270 AW628959 AI383083 BE504391 AA729421 AA552188 AA373387 AW880360 
AW875262 AW875369 AW581540 AW875358 AW581568 R23735 AW134768 
W03912 AW971410 AA506385 AA209530 H73495 H48629 W56149 
H56752 AW340384 N49521 

AA853680 AK001668 BE386425 BE563549 BE296124 BE298950 R51419 U46295 BE147292 AA360056 R48018 AW845348 
N47383 AI817280 AI671902 AA988104 AA479464 N56996 AI192374 AI927558 AA659888 AI799903 AA548397 AI161167 
AI656333 A1418829 AW592671 BE327906 AW513346 AI888579 AW469410 AW512809 D25682 AA576079 AA479354 
T30342 R51307 T16044 H29063 AW079357 AI339477 R47914 AI985068 AI870065 AI868489 AI521099 AI582732 AA995540 
AW957299 AA352608 AA676752 AA410510 AA358874 AI865724 M853679 AI699265 AW188789 N47380 AA233715 



AW886407 AA489268 R57015 R58094 BE077459 BE077423 BE546995 AW849216 T69383 AW9381 11 H60337 BE221073 
AB033100 AA347036 BE260325 AW961669 AL047207 AA347037 A1766894 AA601045 AI559897 AW139033 AW274622 
AW172884 AW089070 AA804340 AW798925 



306805 
303814 
306873 



AI001149 
AI003654 
AI041589 
AI051696 
AI452732 
AI470948 
AI475914 
AI055966 
AI066577 
AI086929 
AI095365 
AI127883 
AI559492 
AI565612 
AI571211 



308511 
308601 
308612 



AI687580 
AI719930 
AI735634 
AI744063 



310570 1071946J 

305022 

305060 
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305079 AA641329 

305134 AA653159 

303977 AW512978 

305216 AA669056 

305263 AA679467 

305266 AA679772 

305396 AA721052 

305403 AA723748 

305488 M749000 

305549 AA773530 

305601 AA780975 

305610 M782319 

305621 M789095 

305710 AA826544 

305724 AA827608 

305744 AA831819 

305752 AA835278 

307018 AI140639 

307055 AI148477 

307058 AI148709 

305801 AA845997 

305830 M857665 

305836 AA858043 

305852 AAB62455 

305858 M863103 

305866 AA864533 

305867 MB64572 
307126 AI184951 
305903 AA873085 
328803 c_7_hs 

328809 c_7_hs 
305949 AA884409 
328829 c_7 hs 
330021 Cl6_p2 
330024 c16_p2 
330028 c16_p2 
330049 c17_p2 
305993 AA889197 

330095 c19_p2 

330096 c19j)2 

307205 AI192479 

307427 AI243437 

307491 AI268539 

307581 AI284415 

307588 AI285535 
337672 CH22_6002FG_UNK_EM:AC00 
337693 CH22_6030FG_LINK_EM:AC00 
337738 CH22_6083FG_UNK_EM:AC00 

307692 AI318342 

307806 AI351739 

309107 AI925823 

309230 AI970747 
339338 CH22_8300FG_LINK_BA354I1 

309257 AI984183 

309366 AW072970 

309422 AW087175 
325207 c10_hs 
325257 c11_hs 

309646 AW194694 

309651 AW195850 
325313 c11_hs 

309924 AW340812 
334030 CH22_1308FQ_320_2_L1NK_EM 
334040 CH22_1318FG_322_8_LINK_EM 
334083 CH22_1361FG_327_38_LiNK^E 
332810 CH22_26FG_7_12_UNK_C65E1 

302747 32813J AF062275 L03830 

302753 33029J M74299 M74302 M74303 

302777 33803J AJ230640 AJ230648 
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304094 


H11295 


302824 35372J 


1121 260 U21258 


302896 41196J 


AF054663AF124197 R70292 


325B70 c16_hs 




304240 


AA009802 


304410 


AA284508 


304443 


AA399444 


304475 


AA428879 


304522 


AM65405 


304678 


AA548556 


304705 


AA564064 


306004 


AA869992 


306008 
306013 


AA894390 
AA896990 


336174 CH22_3567FG_710_1_LINK_DA 


306094 


AA908877 


304823 


AA584837 


304872 


AA595289 


304918 


AA602697 


304955 


AA613504 


306249 


AA933B40 
AA936892 


306295 


AA937331 


306317 


AA947909 


306347 


AA961144 
AA962086 


306365 
30639B 


AA970548 


330401 entrezD28! 
330463 460_2 


383 D28383 

NM 001055 AA332948 U26309 U09031 L19955 L10819 A1366043 X84654 U71086 AV654451 AJ00741 8 M053625 


BE168856 M376730 H12694 AA810348 AA621972 AI818950 AV645367 AI819966 AA910602 AW512449 H67893 AI310497 




AI304330 AI339217 AW193588 AW438688 AI818970 AW316799 AA906527 AA777570 N47673 AI336428 AW945133 




AI038606 R29692 AW194197 AI304748 H12639 M053178 M493213 AA676958 AA113154 AI313469 AI368239 R93183 




W24532 U52852 U54701 AL046864 AA365795 



330535 1374_-8 U11872 

332634 10404_2 U24488NM_007116 
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TABLE 13B shows the genomic positioning for those primekeys lacking unigene ID's and 
accession numbers in Table 13. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



332791 Dunham, I. eUl. Plus 

332792 Dunham, I. et.al. Plus 
332810 Dunham, I. etal. Plus 
332944 Dunham, I. et.al. Plus 
332972 Dunham, I. etal. Plus 
333133 Dunham, I. etal. Plus 

333154 Dunham, I. etal. Plus 

333155 Dunham, I. etal. Plus 
333227 Dunham, I. etal. Plus 
333230 Dunham, I. etal. Plus 
333298 Dunham, I. etal. Plus 
333304 Dunham, I. etal. Plus 



333365 Dunham, I. etal. Plus 

333383 Dunham, I. etal. Plus 

333391 Dunham, I. etal. Plus 

333392 Dunham, I. etal. Plus 
333397 Dunham, I. etal. Plus 
333403 Dunham, I. etal. Plus 
333413 Dunham, I. etal. Plus 
333445 Dunham, I. etal. Plus 
333479 Dunham, I. etal. Plus 
333481 Dunham, I. etal. Plus 
333483 Dunham, I. etal. Plus 

333516 Dunham, I. etal. Plus 

333517 Dunham, I. etal. Plus 

333518 Dunham, I. etal. Plus 
333531 Dunham, I. etal. Plus 
333566 Dunham, I. etal. Plus 
333572 Dunham, I. etal. Plus 
333586 Dunham, I. etal. Plus 
333588 Dunham, I. etal. Plus 

333594 Dunham, I. etal. Plus 

333595 Dunham, I. etal. Plus 

333600 Dunham, I. etal. Plus 

333601 Dunham, I. et.al. Plus 
333607 Dunham, I. et.al. Plus 

333612 Dunham, I. etal. Plus 

333613 Dunham, I. etal. Plus 

333614 Dunham, I. et.al. Plus 
, I. etal. Plus 



n, I. etal. Plus 4630338-4630645 



333637 Dunham, I. etal. Plus 

333642 Dunham, I. etal. Plus 

333647 Dunham, I. etal. Plus 

333653 Dunham, I. etal. Plus 

333654 Dunham, I. etal. Plus 

333656 Dunham, I. etal. Plus 

333657 Dunham, I. etal. Plus 



m, I. etal. Plus 6614174-6614467 
m, Let 



6708760-6709139 
6772502-6772779 
6811130-6811392 



, I. etal. Plus 6835282-6835474 
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333659 Dunham, I. etal. Plus 

3336B4 Dunham, I. etal. Plus 

333686 Dunham, I. et.al. Plus 

333697 Dunham, I. etal. Plus 

333698 Dunham, I. etal. Plus 

333699 Dunham, I. et.al. Plus 
333703 Dunham, I. et.al. Plus 
333709 Dunham, I. etal. Plus 
333747 Dunham, I. etal. Plus 

333774 Dunham, I. etal. Plus 

333775 Dunham, I. etal. Plus 
333806 Dunham, I. etal. Plus 
333843 Dunham, I. etal. Plus 
333854 Dunham, I. etal. Plus 
333873 Dunham, I. etal. Plus 
333880 Dunham, I. etal. Plus 
333885 Dunham, I. etal. Plus 
333918 Dunham, I. etal. Plus 
333947 Dunham, I. etal. Plus 
333961 Dunham, I. etal. Plus 
333981 Dunham, I. etal. Plus 
333991 Dunham, I. etal. Plus 
333994 Dunham, I. etal. Plus 
334030 Dunham, I. etal. Plus 
334083 Dunham, I. etal. Plus 
334111 Dunham, I. etal. Plus 
334135 Dunham, I. etal. Plus 
334218 Dunham, I. etal. Plus 
334249 Dunham, I. etal. Plus 
334262 Dunham, I. etal. Plus 
334264 Dunham, I. etal. Plus 

334327 Dunham, I. etal. Plus 

334328 Dunham, I. etal. Plus 
334340 Dunham, I. etal. Plus 
334454 Dunham, I. etal. Plus 
334504 Dunham, I. etal. Plus 
334508 Dunham, I. etal. Plus 
334512 Dunham, I. etal. Plus 
334582 Dunham, I. etal. Plus 
334659 Dunham, I. etal. Plus 
334721 Dunham, I. etal. Plus 
334723 Dunham, I. etal. Plus 
334730 Dunham, I. etal. Plus 
334774 Dunham, I. etal. Plus 
334778 Dunham, I. etal. Plus 
334851 Dunham, I. etal. Plus 
334885 Dunham, I. etal. Plus 
334902 Dunham, I. etal. Plus 

334905 Dunham, I. etal. Plus 

334906 Dunham, I. etal. Plus 
334910 Dunham, I. etal. Plus 
335018 Dunham, I. etal. Plus 
335025 Dunham, I. etal. Plus 
335033 Dunham, I. etal. Plus 
335044 Dunham, I. etal. Plus 
335142 Dunham, I. etal. Plus 
335157 Dunham, I. etal. Plus 
335160 Dunham, I. etal. Plus 
335174 Dunham, I. etal. Plus 
335188 Dunham, I. etal. Plus 

335190 Dunham, I. etal, Plus 

335191 Dunham, I. etal. Plus 
335193 Dunham, I. et.al. Plus 
335204 Dunham, I. etal. Plus 
335222 Dunham, I. etal. Plus 

335226 Dunham, I. etal. Plus 

335227 Dunham, I. etal. Plus 

335309 Dunham, I. etal. Plus 

335310 Dunham, I. etal. Plus 



6836179-6836248 
7169561-7169742 
7177117-7177302 



7716509-7716636 



9837016-9837081 

10279365-10279531 

10457085-10457183 

12680289-12680378 

13190430-13190574 



13642407-13642522 



15795816-15796987 
15805317-15805399 
15967830-15967934 
16251857-16252178 
16276180-16276395 
17820110-17820810 
19233667-19233787 
19317083-19317195 



20743941-20744050 



21465105-21465186 
21543302-21544341 
21573388-21573497 
21631301-21631447 
21669118-21669328 
21680807-21680876 
21681110-21681183 
21692208-21692362 
21750636-21750726 
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335311 Dunham, I. eUl. 


Plus 


22501602-22501676 
22779222-22779516 


335355 Dunham, I. etal. 
335362 Dunham, I. etal. 


Plus 
Plus 


22809167-22809461 


335363 Dunham, 1. etal. 


Plus 


22843040-22843184 


335384 Dunham, 1. etal. Plus 


22918150-22918263 


335385 Dunham, 1. etal. Plus 


22919072-22919339 


335436 Dunham, 1. etal. Plus 


23427793-23427923 


335440 Dunham,!. etal. Plus 


23458702-2345901 7 


335441 Dunham, 1. etal. 


Plus 


23460632-23460724 


335450 Dunham, 1. etal. 


Plus 


2348D1 90-23480270 


335453 Dunham, 1. etal. 
335458 Dunham, 1. etal. 


Plus 
Plus 


23483333-23483459 
23490034-23490143 


335464 Dunham, 1. etal. 


Plus 


23500331-23500496 


335496 Dunham, 1. etal. 


Plus 


24164386-24164545 


335497 Dunham, 1. etal. 


Plus 


241 67666-24 1 67869 


335493 Dunham, 1. etal. 


Plus 


241 72082-24172161 


335499 Dunham,!. etal. 


Plus 


241 76698-24176869 


335500 Dunham,!. etal. Plus 


24178236-24178326 


335507 Dunham,!. etal. Plus 


242 1 9973-24220039 


335510 Dunham, 1. etal. Plus 


24222975-24223118 


335513 Dunham,!. etal. 


Plus 


24224272-24224496 


335627 Dunham,!. etal. 


Plus 


25150005-25150061 


335651 Dunham,!, etal. 


Plus 


25317560-25317696 


335655 Dunham, 1. etal. 


Plus 


25333211-25333369 


33565S Dunham, 1. etal. 


Plus 


25333601-25333751 


335653 Dunham,!. etal. 


Plus 


25336315-25336406 


335663 Dunham, 1. etal. 


Plus 


25342680-25342802 


335665 Dunham, 1. etal. 


Plus 


25344096-25344287 


335667 Dunham, 1. etal. 


Plus 


25345735-25345B56 


335668 Dunham, 1. etal. 


Plus 


25346313-25346447 


335689 Dunham, 1. etal. 


Plus- 


25454350-25454604 


33569D Dunham, 1. etal. 


Plus 


25455442-25455625 


335715 Dunham, 1. etal. 


Plus 


25565941-25566052 


335719 Dunham, 1. etal. 


Plus 


25593936-25594101 


335734 Dunham, 1. etal. 


Plus 


25688723-25688869 


335744 Dunham, 1. etal. 


Plus 


25716483-25716615 


335809 Dunham,!. etal. 


Plus 


26310772-26310909 


335819 Dunham, 1. etal. Plus 


26356341-26356470 


335822 Dunham, I. etal. 


Plus 


263S4087-26364196 


335872 Dunham, 1. etal. 


Plus 


26820760-26820943 


335885 Dunham, 1. etal. 
335968 Dunham, 1. etal. 


Plus 
Plus 


26933436-26933534 
27743843-27744029 


335971 Dunham, 1. etal. 


Plus 


27752808-27753017 


335975 Dunham, 1. etal. 


Plus 


27801321-27801391 


335976 Dunham, 1. etal. 


Plus 


27809041-27809187 


335989 Dunham, 1. etal. 


Plus 


27983788-27983860 


335990 Dunham,!. etal. 


Plus 


27988532-27988608 


336010 Dunham,!. etal. 


Plus 


28570239-28570330 


336093 Dunham,!. etal. 


Plus 


29556922-29557002 


336126 Dunham,!. etal. 


Plus 


30057891-30058105 


336129 Dunham,!. etal. 


Plus 


30062259-30062348 


336187 Dunham,!. etal. 


Plus 


30433494-30433585 


336188 Dunham, I. etal. 


Plus 


30434870-30435004 


336225 Dunham, I. etal. 


Plus 


30833614-30833788 


336371 Dunham,!, etal. 


Plus 


33968108-33968204 


336373 Dunham, I. etal. 


Plus 


33976308-33976504 


336377 Dunham,!. etal. 


Plus 


33994489-33994599 


336380 Dunham, 1. etal. 

336383 Dunham, 1. etal. 

336384 Dunham, 1. etal. 


Plus 
Plus 
Plus 


33995323-33995434 
34005784-34005964 
34007429-34007559 


336385 Dunham, 1. etal. 

336386 Dunham, 1. etal. 


Plus 
Plus 


34007879-34008159 
34012965-34013115 


336441 Dunham, I. etal. 


Plus 


34187606-34187663 


336444 Dunham, I. etal. 


Plus 


34190585-34190718 


336484 Dunham,!. etal. Plus 


34237425-34237505 


336497 Dunham, I. etal. 


Plus 


34267190-34267245 


336499 Dunham, I. etal. 


Plus 


34267504-34267572 


336503 Dunham,!. etal. Plus 


34271306-34271372 


336548 Dunham,!. etal. Plus 


34353881-34354826 
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336552 Dunham, I. et.al. Plus 

336553 Dunham, I. et.al. Plus 
3365S7 Dunham, t. et.al. Plus 
336568 Dunham, I. eUl. Plus 

5 336659 Dunham, I. et.al. Plus 

336715 Dunham, I. et.al. Plus 

336803 Dunham, I. et.al. Plus 

336805 Dunham, I. etal. Plus 

336850 Dunham, I. etal. Plus 

10 336857 Dunham, I. etal. Plus 

336911 Dunham, I. etal. Plus 

336949 Dunham, I. etal. Plus 

336950 Dunham, I. etal. Plus 
336958 Dunham, I. etal. Plus 

15 336993 Dunham, I. etal. Plus 

337076 Dunham, I. etal. Plus 

337109 Dunham, I. etal. Plus 

337123 Dunham, I. etal. Plus 

337151 Dunham, I. etal. Plus 

20 337189 Dunham, I. etal. Plus 

337241 Dunham, I. etal. Plus 

337337 Dunham, I. etal. Plus 

337353 Dunham, I. etal. Plus 

337384 Dunham, I. etal. Plus 

25 337396 Dunham, I. etal. Plus 

337414 Dunham, I. etal. Plus 

337418 Dunham, I. etal. Plus 

337461 Dunham, I. etal. Plus 

337480 Dunham, I. etal. Plus 

30 337482 Dunham, I. etal. Plus 

337483 Dunham, I. etal. Plus 

337490 Dunham, I. etal. Plus 

337522 Dunham, I. etal. Plus 

337532 Dunham, I. etal. Plus 

35 337552 Dunham, I. etal. Plus 

337584 Dunham, I. etal. Plus 

337611 Dunham, I. etal. Plus 

337672 Dunham, I. etal. Plus 

3376S3 Dunham, I. etal. Plus 

40 337738 Dunham, I. etal. Plus 

337926 Dunham, I. etal. Plus 

337927 Dunham, 1. etal. Plus 
337935 Dunham, I. etal. Plus 
337944 Dunham, I. etal. Plus 

45 337954 Dunham, I. etal. Plus 

337996 Dunham, I. etal. Plus 

338004 Dunham, I. etal. Plus 

338016 Dunham, I. etal. Plus 

338174 Dunham,!. etal. Plus 

50 338176 Dunham, I. etal. Plus 

338238 Dunham, I. etal. Plus 

338277 Dunham, I. etal. Plus 

333294 Dunham, I. etal. Plus 

338316 Dunham, I. etal. Plus 

55 338323 Dunham, I. etal. Plus 

338324 Dunham, I. etal. Plus 

338386 Dunham, I. etal. Plus 

338398 Dunham, I. etal. Plus 

338410 Dunham, I. etal. Plus 

60 338414 Dunham, I. etal. Plus 

338460 Dunham, I. etal. Plus 

338481 Dunham, I. etal. Plus 

338489 Dunham, I. etal. Plus 

338500 Dunham, I. etal. Plus 

65 338514 Dunham, I. etal. Plus 

338530 Dunham, I. etal. Plus 

338620 Dunham, I. etal. Plus 

338631 Dunham, I. etal. Plus 

338653 Dunham, I. etal. Plus 



34356420-34356527 

34356683-34356753 

34428228-34428395 

34428521-34428637 

1896402-1896478 

3110198-3110314 

6106904-6106990 

6126661-6126786 

7745284-7745355 

8130457-8130612 

11035818-11035984 

12818687-12818891 

12875843-12875912 

13203550-13203973 

15096270-15096324 

19338177-19338679 

21166580-21166650 

22052874-22052942 

23106433-23106510 

24225887-24225954 

27280182-27280313 

30395182-30395285 

30804624-30804780 

31333399-31333580 

31585902-31586067 

31953012-31953205 

32014049-32014131 

32803968-32804028 

33219714-33219779 

33227865-33227946 

33237292-33237427 

33318571-33318644 

33963188-33963979 

34187269-34187366 

19497-19600 

945236-945452 

1482883-1483016 

3331236-3331313 

3575975-3576153 

3865738-3865814 

6286377-6286470 

6343033-6343172 

6534661-6534782 

6589383-6589450 

6831483-6831620 

7445532-7445633 

7601363-7601520 

7863131-7863310 

12771102-12771268 

12774072-12774223 

14661936-14662015 

16167622-16167962 

16463958-16464539 

17089711-17089988 

17154655-17154792 

17155309-17155574 

1851 1213-1861 1407 

18953492-18953581 

19292807-19292916 

19345573-19345660 

20233372-20233488 

20942659-20942873 

21142605-21143049 

21253847-21253974 

21379420-21379655 

21636361-21636509 

23540239-23540334 

23711167-23711241 

24219427-24219509 
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338660 Dunham, I. et.al. Plus 24387122-24387266 

338704 Dunham, I. et.al. Plus 25230432-25230548 

338847 Dunham, I. et.al. Plus 27995337-27995420 

3388B7 Dunham, I. etal. Plus 28465244-28465384 

5 338895 Dunham, I. etal. Plus 28598893-28599135 

338915 Dunham, I. etal. Plus 28624881-28824977 

338925 Dunham, I. etal. Plus 28883892-28884036 

338936 Dunham, I. etal. Plus 29148022-29148160 

338952 Dunham, I. etal. Plus 29418831-29418968 

10 338980 Dunham, I. etal. Plus 29896789-29896874 

338981 Dunham, I. etal. Plus 29897917-29898008 

338986 Dunham, I. etal. Plus 30007287-30007415 

339009 Dunham, I. etal. Plus 30343477-30348598 

339017 Dunham, I. et.al. Plus 30420896-30421090 

15 339045 Dunham, I. etal. Plus 30744286-30744356 

339046 Dunham, I. etal. Plus 30746269-30746420 

339059 Dunham, I. etal. Plus 30814655-30814801 

339067 Dunham, I. etal. Plus 30869347-30869412 

339069 Dunham, I. etal. Plus 30880975-30881070 

20 339078 Dunham, I. etal. Plus 30914310-30914423 

339084 Dunham, I. etal. Plus 30944556-30944803 

339101 Dunham, I. etal. Plus 31158047-31158123 

339102 Dunham, I. etal. Plus 31169321-31169563 
_ 339103 Dunham, I. etal. Plus 31170343-31170454 

25 339115 Dunham, I. etal. Plus 31459869-31459927 

339157 Dunham, I. etal. Plus 32131701-32131833 

339166 Dunham, I. etal. Plus 32210902-32211006 

339167 Dunham, I. etal. Plus 32213567-32213730 
339288 Dunham, I. etal. Plus 33169611-33169691 

30 339289 Dunham, I. etal. Plus 33186756-33186903 

339291 Dunham, I. etal. Plus 33205057-33205247 

339407 Dunham, I. etal. Plus 34189461-34189620 

332865 Dunham, I. etal. Minus 1391482-1391218 

332881 Dunham, I. etal. Minus 1563520-1563184 

35 332930 Dunham, I. etal. Minus 2022565-2022497 

332931 Dunham, I. etal. Minus 2023651-2023562 

332984 Dunham, I. etal. Minus 2632606-2632457 

332986 Dunham, I. etal. Minus 2635398-2635206 

332997 Dunham, I. etal. Minus 2710509-2710375 

40 333051 Dunham, I. etal. Minus 2991973-2991840 

333061 Dunham, I. etal. Minus 3029631-3029527 

333064 Dunham, I. etal. Minus 3030722-3030623 

333096 Dunham, I. etal. Minus 3184234-3184118 

333099 Dunham, I. etal. Minus 3206796-3206674 

45 333106 Dunham, I. etal. Minus 3230744-3230547 

333160 Dunham, I. etal. Minus 36548933654678 

333163 Dunham, I. etal. Minus 3665124-3654962 

333165 Dunham, I. etal. Minus 3674052-3673905 

333166 Dunham, I. etal. Minus 3694664-3694567 
50 333170 Dunham, I. etal. Minus 3733394-3733299 

333174 Dunham, I. etal. Minus 3764284-3764210 

333188 Dunham, I. etal. Minus 3826990-3826863 

333214 Dunham, I. etal. Minus 3966559-3966437 

333232 Dunham, I. etal. Minus 4001551-4001365 

55 333237 Dunham, I. etal. Minus 40033264003219 

333239 Dunham, I. etal. Minus 4095861-4094462 

333255 Dunham, I. etal. Minus 4297883-4297716 

333259 Dunham, I. etal. Minus 4305769-4306639 

333274 Dunham, I. etal. Minus 43891464388954 

60 333290 Dunham, I. etal. Minus 4530734-4530554 

333295 Dunham, I. etal. Minus 45492904549198 

333296 Dunham, I. etal. Minus 4550766-4550644 

333310 Dunham, I. etal. Minus 4637315-4637232 

333311 Dunham, I. etal. Minus 4637933-4637844 
65 333312 Dunham, I. etal. Minus 46387944638635 

333313 Dunham, I. etal. Minus 46393974639277 

333315 Dunham, I. etal. Minus 5405980-5405B76 

333318 Dunham, I. etal. Minus 46426364642564 

333321 Dunham, I. etal. Minus 46490804648934 
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333327 Dunham, I. eUL Minus 46579474657828 

333335 Dunham, I. etaL Minus 4672656-4672564 

333337 Dunham, I. etaL Minus 46779304677641 

333454 Dunham, I. etaL Minus 5137007-5136880 

5 333458 Dunham, I. etaL Minus 5143942-5143806 

333459 Dunham, I. etal. Minus 5144548-5144344 

333470 Dunham, I. etal. Minus 5223319-5223088 

333493 Dunham, I. etal. Minus 46373154637232 

333496 Dunham, I. etal. Minus 5404643-5404523 

10 333498 Dunham, I. etal. Minus 5405980-5405876 

333510 Dunham, I. etal. Minus 5557628-5557469 

333546 Dunham, I. etal. Minus 5886643-5886442 

333561 Dunham, I. etal. Minus 5903659-5903590 

333738 Dunham, I. etal. Minus 7552160-7552084 

15 333780 Dunham, I. etal. Minus 7750367-7750277 

333783 Dunham, I. etal. Minus 7751850-7751777 

333818 Dunham, I. etal. Minus 7911959-7911762 

333894 Dunham, I. etal. Minus 8188855-8188709 

333897 Dunham, I. etal. Minus 8194390-8194284 

20 333900 Dunham, I. etal. Minus 8200268-8200122 

333909 Dunham, I. etal. Minus 8229639-8229477 

333936 Dunham, I. etal. Minus 8512805-8512564 

333944 Dunham, I. etal. Minus 8557051-8556936 

334040 Dunham, I. etal. Minus 9342995-9342934 

25 334154 Dunham, I. etal. Minus 10570714-10570572 

334178 Dunham, I. etal. Minus 11755052-11754971 

334188 Dunham, I. etal. Minus 11925963-11925834 

334273 Dunham, I. etal. Minus 13265608-13265522 

334282 Dunham, I. etal. Minus 13285293-13265178 

30 334285 Dunham, I. etal. Minus 13289990-13289793 

334286 Dunham, I. etal. Minus 13291759-13291569 

334303 Dunham, I. etal. Minus 13454331-13454217 

334305 Dunham, I. etal. Minus 13456310-13456209 

334306 Dunham,!. etal. Minus 13461157-13461049 
35 334320 Dunham, I. etal. Minus 13496857-13496717 

334352 Dunham, I. etal. Minus 13675908-13675828 

334353 Dunham, I. etal. Minus 13683722-13683596 
334359 Dunham, I. etal. Minus 13728664-13728534 
334363 Dunham, I. etal. Minus 13740004-13739812 

40 334365 Dunham, I. etal. Minus 13742078-13741971 

334399 Dunham, I. etal. Minus 14186289-14186163 

334409 Dunham, I. etal. Minus 14195181-14195075 

334414 Dunham, I. etal. Minus 14234033-14233932 

334470 Dunham, I. etal. Minus 14389581-14389442 

45 334483 Dunham, I. etaL Minus 14428355-14428281 

334489 Dunham, I. etaL Minus 14455428-14454288 

334498 Dunham, I. etal. Minus 14483789-14483700 

334501 Dunham, I. etal. Minus 14487509-14487356 

334502 Dunham, I. etal. Minus 14488605-14488526 
50 334543 Dunham, I. etal. Minus 14834496-14834116 

334622 Dunham, I. etal. Minus 15191678-15191609 

334650 Dunham, I. etal. Minus 15371251-15371178 

334680 Dunham, I. etal. Minus 15520047-15519887 

334745 Dunham, I. etal. Minus 16049960-16049653 

55 334756 Dunham, I. etal. Minus 16128678-16128528 

334758 Dunham, I. etal. Minus 16132368-16132233 

334761 Dunham, I. etal. Minus 16138424-16138319 

334763 Dunham,]. etal. Minus 16148135-16148077 

334784 Dunham, I. etal. Minus 16294548-16294360 

60 334790 Dunham, I. etal. Minus 16307576-16307509 

334793 Dunham, I. etal. Minus 16330748-16330681 

334802 Dunham, I. etal. Minus 16413158-16413026 

334820 Dunham, I. etal. Minus 16764338-16764249 

334824 Dunham, I. etal. Minus 16857777-16857674 

65 334832 Dunham, I. etal. Minus 17173957-17173760 

334842 Dunham, I. etal. Minus 17464352-17464181 

334844 Dunham, I. etal. Minus 17503891-17503768 

334857 Dunham, I. etal. Minus 18488368-18488242 

334927 Dunham, I. etal. Minus 19988711-19987853 
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334939 Dunham, 1. et.al. 


Minus 


20131162-20131054 


334951 Dunham, 1. et.al. 




20147703-20147502 


3349S9 Dunham, 1. et.al. 




20188176-20188020 


334972 Dunham, 1. et.al. 


Minus 


20294734-20294611 


335050 Dunham, 1. et.al. 


Minus 


20884109-20883951 


335078 Dunham, l.etal. 


Minus 


21059529-21059458 


335102 Dunham, I. etal. 


Minus 


21313841-21313598 


335105 Dunham, I. etal. 


Minus 


21320563-21320440 


335110 Dunham, I. etal. 


Minus 


21334136-21333811 


335111 Dunham, I. etal. 


Minus 


21335946-21335809 


335115 Dunham, I. etal. 


Minus 


21388250-21388146 


335116 Dunham, I. et.al. 


Minus 


21388573-21388414 


335185 Dunham, 1. etal. 


Minus 


21651593-21651522 


335186 Dunham, I. etal. 


Minus 


21656436-21656338 


335230 Dunham, I. etal. 




21899517-21898678 


335236 Dunham, I. etal. 


Minus 


21915016-21914870 


335243 Dunham, I. etal. 


Minus 


21933519-21933365 


335249 Dunham, I. etal. 


Minus 


21950851-21950669 


335258 Dunham, I. etal. 


Minus 


22043431-22043262 


335261 Dunham, 1. etal. 


Minus 


22063937-22063772 


335276 Dunham, 1. etal. 


Minus 


22154036-22153937 


335279 Dunham, l.etal. 


Minus 


22168834-22168638 


335330 Dunham, l.etal. 


Minus 


22556589-22556422 


335331 Dunham, I. etal. 




22556823-22556708 


335334 Dunham, I. etal. 




22560390-22560136 


335346 Dunham, I. etal. 


Minus 


22641097-22640918 


335349 Dunham, I. etal. 


Minus 


22661861-22661271 


335611 Dunham, I. etal. 


Minus 


25070825-25070706 


335612 Dunham, I. etal. 


Minus 


25072328-25072142 


335671 Dunham, I. etal. 


Minus 


25358629-25358533 


335676 Dunham, I. etal. 


Minus 


25395274-25395152 


335680 Dunham, I. et.al. 


Minus 


25402437-25402361 


335750 Dunham, I. etal. 


Minus 


25732501-25731972 


335752 Dunham, I. etal. 


Minus 


25757026-25756890 


335755 Dunham, I. etal. 


Minus 


25763806-25763747 


335767 Dunham, I. etal. 




25819547-25819218 


335774 Dunham, I. etal. 


Minus 


25883733-25883572 


335777 Dunham, 1. etal. 




25885770-25885599 


335778 Dunham, l.etal. 




25886469-25886334 


335797 Dunham, 1. et.al. 


Minus 


25958182-25958030 


335300 Dunham, l.etal. 


Minus 


25985373-25985280 


335318 Dunham, I. etal. 


Minus 


26323886-26323744 


335834 Dunham, I. etal. 


Minus 


26391707-26391530 


335840 Dunham, l.etal. 


Minus 


26420596-26420538 


335844 Dunham, l.etal. 




26433427-26433344 


335846 Dunham, l.etal. 


Minus 


26436727-26436621 


335856 Dunham, 1. etal. 


Minus 


26662452-26662346 


335887 Dunham, l.etal. 


Minus 


26939225-26938782 


335888 Dunham, l.etal. 


Minus 
Minus 


26943037-26942820 
26946988-26946901 


335889 Dunham, 1. etal. 

335890 Dunham, l.etal. 


Minus 


26949087-26948665 


335893 Dunham, 1. etal. 


Minus 


26973898-26973747 


335895 Dunham, l.etal. 


Minus 


26975307-26975239 


335896 Dunham, 1. etal. 


Minus 


26977639-26977558 


335900 Dunham, l.etal. 


Minus 


26980354-26980238 


335907 Dunham, 1. etal. 


Minus 


27013352-27013273 


335943 Dunham, 1. etal. 




274466 1 0-27446378 


335956 Dunham, 1. etal. 


Minus 


27653729-27653635 


335959 Dunham, l.etal. 


Minus 


27682313-27682145 


335962 Dunham, 1. etal. 


Minus 
Minus 


27704276-27704144 
29036458-29036300 


336040 Dunham, 1. etal. 
336044 Dunham, l.etal. 


Minus 


29043828-29043727 


336047 Dunham, l.etal. 


Minus 


29050617-29050466 


336068 Dunham, l.etal. 


Minus 


29252077-29251969 


336143 Dunham, I. etal. 


Minus 


30135948-30135854 


336158 Dunham, I. etal. 


Minus 


30163730-30163610 


336174 Dunham, I. etal. 


Minus 


30241988-30241839 


336223 Dunham, l.etal. 


Minus 


30816306-3Q816195 


336245 Dunham, l.etal. 


Minus 


31420569-31420509 
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336274 Dunham, I. et.al. Minus 32085468-32085303 

336318 Dunham, I. et.al. Minus 33364452-33364338 

336326 Dunham, I. et.al. Minus 33567328-33567201 

336339 Dunham, I. et.al. Minus 33798479-33798330 

5 336340 Dunham, 1. et.al. Minus 33812069-33811915 

336355 Dunham, I. et.al. Minus 33874750-33874649 

336392 Dunham, I. et.al. Minus 34015868-34015736 

336393 Dunham, I. et.al. Minus 34016145-34015951 

336394 Dunham, I. etal. Minus 34016457-34016298 
10 336400 Dunham, I. etal. Minus 34023437-34023298 

336402 Dunham, I. etal. Minus 34024090-34023981 

336413 Dunham, I. etal. Minus 34046702-34046576 

336424 Dunham, I. etal. Minus 34055549-34055491 

336425 Dunham, I. etal. Minus 34058544-34058446 
15 336437 Dunham, I. etal. Minus 34074154-34074090 

336447 Dunham, I. etal. Minus 34198207-34197996 

336449 Dunham, I. etal. Minus 34204707-34204577 

336466 Dunham, I. etal. Minus 34213195-34213046 

336492 Dunham, I. etal. Minus 34255578-34255437 

20 336511 Dunham, I. etal. Minus 34277480-34277351 

336512 Dunham, I. etal. Minus 34278373-34278275 

336520 Dunham, I. etal. Minus 34319184-34319101 

336522 Dunham, I. etal. Minus 34320169-34320056 

336524 Dunham, I. etal. Minus 34321055-34320921 

25 336527 Dunham, I. etal. Minus 34322071-34321966 

336534 Dunham, I. etal. Minus 34326797-34326620 

336536 Dunham, I. etal. Minus 34327678-34327538 

336542 Dunham, I. etal. Minus 34331316-34331183 

336556 Dunham, I. etal. Minus 34375244-34374907 

30 336557 Dunham, I. etal. Minus 34375443-34375341 

336558 Dunham, I. etal. Minus 34375825-34375698 

336559 Dunham, I. etal. Minus 34376430-34376261 

336560 Dunham, I. etal. Minus 34376814-34376596 

336561 Dunham, I. etal. Minus 34377168-34376928 
35 336597 Dunham, I. etal. Minus 7627912-7627757 

336601 Dunham, I. etal. Minus 13265853-13265654 

336642 Dunham, I. etal. Minus 1304281-1304212 

336645 Dunham, I. etal. Minus 1351268-1351168 

336662 Dunham, I. etal. Minus 2158060-2157993 

40 336664 Dunham, I. etal. Minus 1993558-1993481 

336676 Dunham, I. etal. Minus 2022565-2022497 

336684 Dunham, I. etal. Minus 2158060-2157993 

336686 Dunham, I. etal. Minus 2160698-2160486 

336714 Dunham, I. etal. Minus 3094026-3093871 

45 336719 Dunham, I. etal. Minus 3331631-3331503 

336736 Dunham, 1. etal. Minus 40931284093041 

336744 Dunham, I. etal. Minus 4333001-4332848 

336786 Dunham, I. etal. Minus 5419973-5419873 

336793 Dunham, I. etal. Minus 5631345-5631237 

50 336859 Dunham, I. etal. Minus 8201756-8201561 

336863 Dunham, I. etal. Minus 8396673*396425 

336933 Dunham, I. etal. Minus 11760045-11759981 

336942 Dunham, I. etal. Minus 12027537-12027455 

336960 Dunham, I. etal. Minus 13267243-13267172 

55 336969 Dunham, I. etal. Minus 13725722-13725643 

336971 Dunham, I. etal. Minus 13732308-13732221 

337003 Dunham, I. etal. Minus 15523541-15523422 

337011 Dunham, I. et.al. Minus 16106423-16106080 

337070 Dunham, I. etal. Minus 19034423-19034321 

60 337072 Dunham, I. etal. Minus 19077452-19077323 

337086 Dunham, I. etal. Minus 19657011-19656881 

337140 Dunham, I. etal. Minus 22649450-22649388 

337193 Dunham, I. etal. Minus 24594969-24594874 

337256 Dunham, I. etal. Minus 27659956-27659876 

65 337278 Dunham, I. etal. Minus 28429017-28428848 

337234 Dunham, I. etal. Minus 28491414-28491094 

337293 Dunham, I. etal. Minus 28846334-28845873 

337316 Dunham, I. etal. Minus 29657129-29656997 

337326 Dunham, I. etal. Minus 30017199-30017069 
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337382 Dunham, I. etal. Minus 31233666-31233579 

337392 Dunham, I. etal. Minus 31442311-31442229 

337406 Dunham, I. etal. Minus 31864840-31864588 

337412 Dunham, I. etal. Minus 31916487-31916312 

5 337419 Dunham, I. etal. Minus 32021495-32021170 

337436 Dunham, I. etal. Minus 32257869-32257739 

337455 Dunham, I. etal. Minus 32434517-32434425 

337509 Dunham, I. etal. Minus 33414613-33414498 

337518 Dunham, I. etal. Minus 33793750-33796647 

10 337529 Dunham, I. etal. Minus 34043668-34043548 

337533 Dunham, I. etal. Minus 34193388-34193261 

337539 Dunham, I. etal. Minus 34254490-34254322 

337551 Dunham, I. etal. Minus 34524446-34524362 

337553 Dunham, I. etal. Minus 24230-24160 

15 337591 Dunham, I. etal. Minus 1006414-1006184 

337592 Dunham, I. etal. Minus 1007791-1007634 

337593 Dunham, I. etal. Minus 1009460-1009291 
337607 Dunham, I. etal. Minus 1355719-1355637 
337612 Dunham, I. etal. Minus 1570235-1570142 

20 337635 Dunham, I. etal. Minus 2169690-2169569 

337824 Dunham, I. etal. Minus 45595404559266 

337825 Dunham, I. etal. Minus 45671554567005 
337850 Dunham, I. etal. Minus 5077143-5076943 
337854 Dunham, I. etal. Minus 5153435-5153272 

25 337913 Dunham, I. etal. Minus 6149843-6149786 

337915 Dunham, I. etal. Minus 5922748-5922690 

337968 Dunham, I. etal Minus 7095797-7095680 

338010 Dunham, I. etal. Minus 7754282-7754184 

338012 Dunham, I. etal. Minus 7761421-7761351 

30 338017 Dunham, I. etal. Minus 7864521-7864401 

338065 Dunham, I. etal. Minus 7235048-7234950 

338094 Dunham, I. etal. Minus 9595602-9595440 

338129 Dunham, I. etal. Minus 10915338-10915237 

338132 Dunham, I. etal. Minus 10989617-10989530 

35 338150 Dunham, I. etal. Minus 11478551-11478355 

338157 Dunham, I. etal. Minus 11731444-11731375 

338195 Dunham, I. etal. Minus 13484103-13483972 

338255 Dunham, I. etal Minus 15242294-15242231 

338276 Dunham, I. etal. Minus 16109555-16109398 

40 338431 Dunham, I. etal. Minus 19747608-19747496 

338448 Dunham, I. etal. Minus 20151152-20151054 

338451 Dunham, I. etal. Minus 20174286-20174193 

338477 Dunham, I. etal. Minus 20821897-20821838 

338534 Dunham, I. etal. Minus 21771238-21771170 

45 338682 Dunham, I. etal. Minus 24800712-24800461 

338684 Dunham, I. etal. Minus 24827522-24827428 

338689 Dunham, I. etal. Minus 24893073-24892972 

338695 Dunham, I. etal. Minus 25104153-25104016 

338825 Dunham, I. etal. Minus 27664798-27664712 

50 338842 Dunham, I. etal. Minus 27824238-27824079 

338893 Dunham, I. etal. Minus 28491807-28491631 

338904 Dunham, I. etal. Minus 28766345-28766253 

338935 Dunham, I. etal. Minus 29071537-29071461 

339022 Dunham, I. etal. Minus 30523414-30523289 

55 339034 Dunham, I. etal. Minus 30621603-30621422 

339190 Dunham, I. etal. Minus 32403103-32402985 

339212 Dunham, I. etal. Minus 32494335-32494210 

339213 Dunham, I. etal. Minus 32496590-32496440 
339216 Dunham, I. etal. Minus 32504250-32504109 

60 339233 Dunham, I. etal. Minus 32751331-32751238 

339258 Dunham, I. etal. Minus 32934756-32934615 

339262 Dunham, I. etal. Minus 32971258-32971090 

339263 Dunham, I. etal. Minus 32974634-32974452 
339265 Dunham, I. etal. Minus 32975943-32975806 

65 339338 Dunham, I. etal. Minus 33468726-33468606 

339396 Dunham, I. etal. Minus 34017306-34017205 

339400 Dunham, I. etal. Minus 34045024-34044940 
339425 Dunham, I. etal. Minus 34407911-34407798 
325207 6552430 Plus 140049-140170 
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32956B 3962490 
329517 3983513 
325313 5866865 
325327 5866875 
325317 5866878 
325257 5866895 
329632 6729060 
325371 5866920 
325375 5866920 
325378 5866920 

325469 6017034 

325470 6017034 
325576 6552443 
325505 6682451 
325543 6682452 

329635 5302817 

329636 5302817 
325593 5666992 
325675 5867014 
325704 5867028 
325682 6138923 
325785 6381957 
3256S6 6469822 
325818 6682490 
329777 6002090 
329768 6015501 
329759 6048280 
329731 6065783 
329687 6117855 
329676 6272128 
329667 6272129 

329669 6272129 

329670 6272129 
329641 6468233 
329791 6469354 
325826 5867048 
325829 5867052 
329888 6067149 
329893 6525313 
329899 6563505 
325988 5867064 
325855 5867067 
325999 5867073 
326001 5867073 
325886 5867087 
325682 5867087 
325905 5867104 
325922 6867122 
325937 5867132 

325960 5867147 

325961 5867147 

325838 6552452 

325839 6552452 

325840 6552452 
325844 6552453 
325870 6682492 
329984 4646193 
329976 4878063 
329935 6165200 
329916 6223624 
330021 6671889 
330024 6671908 
330028 6671908 
326033 5867178 
326036 5867178 
326056 5867184 
326116 5867193 
326122 5867194 
326138 5867203 



Plus 36331-36750 

Minus 53197-53269 

Minus 27385-28192 

Plus 75189-75264 

Minus 156551-156649 

Plus 10867-10955 

Plus 192813-193017 

Minus 1035422-103553S 

Minus 1165503-1165810 

Minus 1187981-1188167 

Plus 286823-286991 

PIUS 287578-287663 

Minus 137769-137894 

Minus 240852-240946 

Plus 151873-152057 

Minus 62522-62622 

Minus 64969-65078 

Minus 469726-469860 
Plus -955517-955711 

Plus 156198-156387 

Plus 370618-370763 

Plus 61849-62003 

Plus 16769-16857 

Minus 120278-120559 

Minus 191389-191479 

Pius 118315-118422 

Minus 37647-37730 

Plus 158772-158900 

Minus 22165-22288 

Minus 142207-142359 

Plus 101355-101745 

Plus 131223-131291 

Plus 131351-131495 

Minus 105995-106107 

Minus 131982-132089 

Minus 46361-46458 

Plus 232674-233060 

Minus 37227-37473 

Minus 166123-166791 

Minus 111058-111783 

Plus 17349-17606 

Plus 276141-276251 

Plus 149115-149192 

Plus 155223-155348 

Plus 194694-194915 

Minus 8178-8347 

PIUS 78779-78876 

Minus 329063-329134 
Minus 152633-152902 
Minus 162506-162635 
Minus 165106-165209 
Plus 171451-171532 
Plus 181964-182037 
Plus 184380-184547 
Minus 14188-14332 
Plus 228209-228297 
Minus 139780-139890 
Minus 62584-62691 
Minus 69059-69127 
Plus 36396-37195 
Plus 120938-121032 
Minus 1005-1270 
Minus 30015-30144 
Plus 37261-37333 
Minus 120215-120273 
Minus 181553-181690 
Plus 45548-45604 
Plus 144397-144683 
Minus 179374-179436 
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326145 5867204 
326180 5867211 
326201 5867216 
326207 5867222 
326226 5867230 
326233 5867232 
326238 5867260 
326241 5867260 
326243 5867261 
326251 5867263 
326268 5867267 
326124 5916395 
326339 6056311 
330049 4567182 
326353 5867293 
326365 5867297 
326379 5867327 
326382 5867327 
326390 5867340 
326424 5857369 
326453 5867399 
326472 58S7404 
326492 5867422 
326533 5867441 
330117 6015201 

330115 6015202 

330116 6015202 

330095 6015278 

330096 6015278 
326644 5867559 
326713 5867595 
326745 5867611 

326752 5867615 

326753 5867616 
326598 5867634 
326667 6552455 
326855 6552460 
326812 6682504 
327005 5867664 
327008 5867664 
326896 5867680 
326904 5867684 
326951 6004446 
326941 6004446 
326943 6004446 
326928 6456782 




330158 6580367 
327204 5867447 
327208 5867447 
327266 5867462 
327277 5867473 
327289 5867481 
327296 5867492 
327237 5867544 
327145 5867548 
327333 5902477 
327335 5902477 
327343 6017017 
327350 6249563 
327358 6552411 
327360 6552411 
327409 5867750 
327424 5867751 
327430 5867754 
327470 5867772 



Mlnjs 52599-52814 

Minus 182758-183222 

Minus 166168-166959 

Plus 48139-48219 

Plus 52644-52705 

Plus 124788-124863 

Plus 64282-64338 

Minus 181648-181916 

Plus 123838-123978 

Minus 82716-82822 

Plus 122114-122765 

Plus 407102-407560 

Minus 164637-165251 

Minus 314662-315210 

Plus 9122-9195 

Minus 96630-96764 

Plus 32299-32402 

Minus 50420-50503 

Minus 108814-110592 

Minus 168329-168409 

Plus 86222-86423 

Plus 293739-293940 

Plus 120768-120991 

Minus .532153-532280 

Minus 7340-7680 

Plus 11403-11677 

Plus 12109-12418 

Plus 15343-15814 

Plus 49370-49458 

Plus 42684-42819 

Plus 121511-121798 

Plus 127130-127318 

Minus 1214-1562 

Pius 12454-12511 

Plus 68955-69014 

Plus 142311-142441 

Minus 111390-111463 

Plus 189811-189941 

Plus 610847-610907 

Plus 928737-928811 

Minus 12032-12122 

Minus 9280-9606 

Plus 193812-193998 

Plus 62018-62896 

Minus 89242-89427 

Minus 291007-291219 

Minus 42952-43082 

Minus 43159-43301 

Plus 694486-694998 

PIUS 41925-42083 

Plus 81966-82456 

Plus 165135-165239 

Plus 180805-180864 

Minus 82400-82615 

Minus 165616-165715 

Plus 49296-49536 

Plus 7627-8166 

Minus 59702-59813 

Minus 404B2-40551 

Minus 141448-141609 

Minus 142979-143124 

Minus 12288-12395 

Minus 41890-41985 

Minus 3802-3950 

Minus 6255-6422 

Minus 52949-53011 

Plus 160442-160598 

PIUS 1320-1403 

PIUS 150910-150973 
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327460 6004455 
327498 6017023 

327509 6117815 

327510 6117815 
327512 6117815 
327535 6525279 
330163 6042042 
330171 6848220 
327579 5867824 
327672 5867843 
327629 5867872 
327640 5867890 
327649 5867899 
327612 6525283 
327718 6525284 
327801 5867924 

327762 5867961 

327763 5867981 
327776 5867964 

327822 5867968 

327823 5867968 
327807 5867968 
327845 6531962 
330228 6013527 
330190 6165182 
328122 5868031 
328132 5868038 
328159 5868065 
328168 5868071 
328175 5668073 
328217 5B68096 

327865 5868130 

327866 5868131 
327870 5668131 
327879 5668142 
327902 566815B 
327918 5868165 
327934 5B68184 
327959 5868210 
327976 5868212 
328020 5902482 
328042 5902482 
328008 5902482 
330301 2905862 
330299 2905881 
328274 5868219 
328595 5868224 
328591 5868227 
328668 5868254 
328677 5868256 
328687 5868262 
328706 5868270 
328711 5868271 
328730 5868289 
328732 5868289 
328734 5866289 
328752 5868298 
328755 5868301 
328761 5868302 
328775 5868309 
328784 5868309 
328787 5868309 
328809 5868327 
328B29 5868337 
328280 5868352 
328311 5868371 
328318 5868373 
328323 5868373 
328348 5868383 



Plus 175245-175343 

Minus 42178-42283 

Minus 54882-55053 

Minus 56824-56944 

Plus 176256-176325 

Plus 19105-19175 

Minus 20321-20385 

Plus 110889-111575 

Minus 37229-38335 

Minus 69649-69740 

Plus 49692-49811 

Plus 9448-95B6 

Plus 205871-205927 

Plus 2747-2924 

Plus 86123-86186 

Plus 23239-23348 

Minus 50303-50439 

Plus 229347-229476 

Minus 164308-164486 

Minus 168886-169633 

Minus 170359-170433 

Plus 33745-33811 

Plus 193402-193549 

Minus 3719-3787 

Plus 36103-36243 

Plus 158474-158656 

Minus 126737-126839 

Minus 52957-53162 

Plus 60321-60479 

Plus 208-271 

Minus 3742-4362 

Plus 61503-62205 

Minus 2893-3046 

Plus 53558-53757 

Minus 77722-77793 

Minus 133339-133467 

Plus 547530-547591 

Plus 41830-42036 

Minus 46497-46682 

Minus 349301-349409 

Minus 556386-556652 

Minus 1985085-1986626 

Plus 296663-297151 

Minus 4420-5781 

Minus 1020-1382 

Minus 31244-31439 

PIUS 148738-148967 

Minus 237647-237726 

Minus 10888-10984 

Minus 58708-58950 

Plus 624479-624585 

Plus 165501-165614 

Minus 97797-97990 

Plus 8068-8214 

Plus 37437-37550 

Plus 50559-50747 

Minus 114911-115087 

Minus 145959-146446 

Minus 239308-239412 

Plus 12845-12920 

Minus 74523-74604 

Plus 135772-135963 

Plus 91792-91849 

Plus 36309-36630 
Plus 160563-160631 
Minus 170560-170826 

Plus 414945-415620 
Minus 1080089-1080235 

Minus 260272-260379 
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328377 5868390 

328436 5868417 

328504 5868471 

328506 5868471 

328522 5868477 

328525 5868482 

328541 5868486 

328662 6004473 

328663 6004473 
328803 6004475 
328304 6004478 
328927 5868500 
328936 5868500 
328939 6004481 
328941 6456765 
328948 6456765 
3289S8 6456775 
330316 6007576 

330350 3056622 

330351 3056622 
330348 4544475 
329034 5868561 
329046 5868569 
329053 5868574 
329186 5868711 
329237 5868729 
329276 5868762 
329333 5868806 
329376 5868859 
329384 5B68B69 
329140 6017060 
329317 6381976 
329319 6381976 
329129 6588026 
329373 6682537 
329412 6682553 
329424 6B68879 
329446 5868886 
329449 5868B86 



PIUS 16947-17023 

Plus 203760-203904 

Plus 47064-47217 

Plus 60716-60830 

Plus 1972307-1972452 

Plus 12387-14313 

Plus 130956-131050 

Plus 1184773-1184855 

Plus 1185279-1186634 

Minus 291716-291948 

Minus 3884-3952 

Minus 428829-428893 

Minus 1352202-1352259 

Minus 131139-131320 

Minus 9817-9885 

Plus 28227-28413 

Plus 117442-118283 

Minus 119761-119931 

Minus 26413-26820 

Minus 27522-27614 

Minus 19855-19962 

Minus 32819-32939 

Plus 18971-19030 

Plus 426453-426641 

Minus 13108-13225 

PIUS 133238-133339 

Minus 222629-222709 

Plus 392666-392746 

Plus 52356-52694 

Minus 116524-116662 

Plus 290842-290905 

Plus 614823-615209 

PIUS 721390-721470 

PIUS 14456S-144712 

Minus 38950-39301 

Minus 68948-69041 

Plus 362196-362344 

Plus 84776-84899 

Plus 97697-97771 
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TABLE 14: shows genes, including expression sequence tags, down-regulated in prostate 
tumor tissue compared to normal prostate tissue as analyzed using Affymetrix/Eos Hu02 
GeneChip array. Shown are the ratios of "average" normal prostate to "average" prostate 
5 cancer tissues. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession numbe 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Background subtracted normal prostate : prostate tumor tissue 

Pkey ExAccn UnigenelD Unigene Title 

331328 AA281133 Hs.88808 ESTs 

320875 DB0641 Hs.131921 ESTs 

300994 AI251936 Hs.145298 ESTs 

323461 AA418762 Hs.190044 ESTs 

301015 AA947682 Hs.217173 ESTs; Weakly similar to Chain A; Cdc42hs-Gdp Complex [H.sapiens] 

319419 AA543093 Hs.13648 ESTs; Highly similar to mitogen-induced [M.musculus] 

323486 C05278 Hs.166800 ESTs; Moderately simifar to [PYRUVATE DEHYDROGENASE(UPOAMIDE)] 

KINASE ISOZYME 4 PRECURSOR [H.sapiensl 

324882 AW419080 Hs.250645 ESTs 

330S69 U57796 Hs.57679 zinc finger protein 192 



330769 AA465192 
312614 A1766732 
314790 AW341754 
309979 AW452118 



324307 AA627642 
303685 AW500106 
314921 AW452382 
315840 AA679001 
332776 AA034364 
313533 AW298141 
303494 F30712 
317490 AI627358 
332546 D84454 
334719 

300679 AA813958 
311811 AI625304 
315310 AW511298 
312871 H86747 
324715 AI739168 



316160 AW197887 
313833 AA766825 
315850 AW270550 
303124 AF1 61350 
323346 AL134932 
301383 AA913591 
324513 AW501678 



312381 R42049 
317514 AW451570 
319750 AA621606 



Hs.142230 ESTs 
Hs.188836 ESTs 
Hs.146217 ESTs 
Hs.16514 ESTs 
HS201194 ESTs 
Hs.189305 ESTs 
Hs.257533 EST 
Hs.189023 ESTs 

CH.X_hsgi|5868716 
Hs.4994 transducer of ERBB2; 2 (TOB2) 

EST cluster <not in UniGene) with exon hit 
Hs.257564 ESTs 
Hs.192221 ESTS 

Hs.256551 ESTs; Weakly similar to Hi! ALU CLASS B WARNING ENTRY I!!! [H.sapiens] 
Hs.157975 ESTs 

EST cluster (not in UniGene) with exon hit 
Hs.148367 ESTs 

Hs.21899 solute carrier family 35 (UDP-galactose transporter); member 2 



HS207727 ESTs; Moderately similar to KIAA0071 [H.sapiens] 



ESTs 

KIAA1116 protein 

EST cluster (not in UniGene) 

ESTs 

ESTs 

ESTs 

EST cluster (not in UniGene) 
ESTs 

EST cluster (not in UniGene) with exon hit 

ESTs 

ESTs 

ESTs 

EST cluster (not in UniGene) with exon hit 
EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
ESTs 



Hs.143607 
Hs.126480 
Hs.164577 
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314754 
316038 
31B473 
5 307848 
300730 
303034 



10 300547 
316100 
314801 
320856 
313188 

15 314187 
311826 



AW449204 

W60843 

AI679131 

AA541323 

N53442 

AW203986 

AA481027 

D59945 

AB39702 



Hs.257125 
Hs.31570 
Hs.201424 
Hs.1 15831 



Hs.1 79573 
Hs.1 18920 
Hs.1 22826 



EST cluster (not in UniGsne) 
4 ESTs 
3 ESTs 
3 ESTs 

EST singleton (not in UniGene) with exon hit 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs; Weakly similar toORF YGR245c [S.csrevisiae] 
EST cluster (not in UniGene) 
collagen; type I; alpha 2 
ESTs 
ESTs 

EST cluster (not in UniGene) with exon hit 
ESTs 

EST cluster (not in UniGene) 



324771 AA631739 EST cluster (not in UniGene) 

323691 AA317561 EST cluster (not in UniGene) 

303525 AW516519 Hs.1 15130 ESTs 

309709 AW242630 EST singleton (not in UniGene) with exon hit 

300038 AFFX control: MurlL4 

316526 AI0881 92 Hs.1 35474 ESTs; Weakly similar to ATP-DEPENDENT RNA HEUCASE A [H.sapiens] 

313029 AA731520 Hs.170504 ESTs 

304356 AA196027 Hs.1 95188 glyceraldehyde-3-phosphate dehydrogenase 

314610 AI948688 Hs.191805 ESTs 

329815 CH.14_p2gi|6624888 

314949 AI745387 Hs.239124 ESTs 

300598 N53574 Hs.1 58932 ESTs 

329218 CH.X_hsgi|5868726 

315706 AW440742 Hs.1 55556 ESTs 

303751 AW503637 EST cluster (not in UniGene) with exon hit 

307783 AI347274 EST singleton (not in UniGene) with exon hit 

321414 AA324975 Hs.1 28993 ESTs; Weakly similar to KIAA0465 protein [H.sapiens] 

312187 AA700439 Hs.188490 ESTs 
334061 CH22_FGENES.327J4 
336036 CH22_FGENES.678_7 

321477 H67818 Hs.222059 ESTs 

316760 AW139383 Hs.245437 ESTs 

316733 AA811713 Hs.183222 ESTs 
Hs.79828 ESTs 
Hs.145704 ESTs 



1 AA304986 
9 AA740616 
4 AA814859 



Hs.1 79100 
7 AA258222 Hs.87757 
1 AI989538 Hs.191074 
6 AA749062 Hs.180285 



EST cluster (not in UniGene) 

ESTs 

ESTs 

ESTs 

ESTs 

EST cluster (not in UniGene) 

EST singleton (not in UniGene) with exon hit 

ESTs 

ESTs 

EST cluster (not in UniGene) 



314301 AW297967 
312335 AW043620 
322957 AA247755 
316848 AA830053 
313473 AA009660 
318518 T27119 
313383 AI076370 
331389 AA458637 
304257 AA053294 
309917 AW340014 
319661 H08035 



Hs.189886 ESTs 
Hs.188181 ESTs 
Hs.236993 ESTs 

EST cluster (not in UniGene) 
Hs.126798 ESTs 

Hs.251948 ESTs; Moderately similar to T07D3.7 [C.elegans] 

EST cluster (not in UniGene) 
Hs.134037 ESTs 
Hs.152207 ESTs 

EST singleton (not in UniGene) with exon hit 

EST singleton (not in UniGene) with exon hit 
Hs.21398 ESTs; Moderately similar to PUTATIVE GLUCOSAMINE-6-PHOSPHATE 
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321253 AI699484 



M11507 
324330 AA884766 
320014 AA137114 



ISOMERASE [H.sapiens] 
EST cluster (not in UniGene) 
ESTs 



AFFX control: transferrin receptor 
EST cluster (not in UniGene) 
1 ESTs 



318146 AI040125 
323348 AA233056 
305703 AA825148 



324034 AA382603 

319761 R84237 

20 317013 AA864468 

317383 M913887 

314659 AW277121 

312479 Alg50844 

25 311824 AW293826 



Hs.150521 E 
Hs.191518 E 
Hs.21229 F 

Hs.127748 
Hs.159560 
Hs.160994 



EST cluster (not in UniGene) 



ESTs 
ESTs 
ESTs 

EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
ESTs 
ESTs 
ESTs 

ESTs; Weakly similar to non-lens beta 
CH22 FGENES.7J0 
ESTs 



gamma-crystallin like protein [Rsapiens] 



316074 AW517542 Hs.208382 ESTs 



312071 AA583529 

312684 AW294020 

332668 AA062971 

322139 H53744 

304168 H77679 
325602 

319885 H59096 

300611 N75450 

316854 AAB31215 

318208 AID91458 

331623 R38715 

324616 AI823999 



300767 AW193466 
313463 AI0573S9 
320600 AA135565 
301180 AI308989 
324825 AA704457 
300336 AW292417 



321142 AI817933 

319478 R06841 

300793 AI248571 

313733 AA836116 

326505 

314987 AW015506 

303114 AF090948 

3187D9 H24244 



Hs.143119 
Hs.117721 
Hs.181161 



65 312887 AW157377 
315183 AW136134 
300259 AI479011 
313240 AI743261 
316697 AW293174 



Hs. 177230 
Hs.132910 
Hs.220277 
Hs.170783 
Hs.131860 
Hs.252627 



EST singleton (not in UniGene) with exon hit 
ESTs 
ESTs 

ESTs; Weakly similar to INHIBITOR OF APOPTOSIS PROTEIN 1 [M.musculus] 

EST cluster (not in UniGene) 

EST singleton (not in UniGene) with exon hit 

CH.13_hsgi|5866994 

ESTs 

EST cluster (not in UniGene) with exon hit 

ESTs; Weakly similar to predicted using Genefinder [Celegans] 

ESTs 

Homo sapiens clone 24540 mRNA sequence 
ESTs 

EST singleton (not in UniGene) with exon hit 
ESTs 



ESTs 
ESTs 

ESTs; Moderately similar to gag [H.sapiens] 

ESTs; Moderately similar to high-risk human papilloma viruses E6 

oncoproteins targeted protein E6TP1 alpha [H.sapiens] 

EST cluster (not in UniGene) 

CH22_DA59H18.GENSCAN.28-7 

EST cluster (not in UniGene) 

ESTs 

EST cluster (not in UniGene) 
ESTs 

EST cluster (not in UniGene) 

CH.19_hsgi|5867435 

ESTs 

EST cluster (not in UniGene) with exon hit 
ESTs; Weakly similar to /prediction 
ESTs 

CHX.hsgi|5868728 
CH.06_hsgi| 



ESTs 
ESTs 
ESTs 
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302697 AJ001408 



316240 AA974253 
323208 AA203415 
321643 W76005 
330723 AA243617 
323455 AA256675 
308383 AI624497 
328744 

332344 W45574 
328121 

321915 AI670955 



300664 AI444628 
323362 AL135067 
25 300024 M10098 
325026 AI671168 



30 



313389 
301309 
313570 
316504 
319401 
312827 
327871 
35 337173 



AI765132 

M78276 

AA041455 

AW1 35854 

R01342 

AI744361 



3 AA465635 
3 AL1 18754 
7 AI791138 
9 AA830515 



322114 
313671 
303211 



AI701559 
N36417 
AA643791 



Hs.189061 ESTs 

ze31a12.s1 Soares retina N2b4HR Homo sapiens cDNA clone 

IMAGE:36574 3', mRNA sequence . 
Hs.1 60870 ESTs 

Hs.248572 Human PAC clone DJ404F18 from Xq23 

EST cluster (not in UniGene) with exon hit 

EST singleton (not in UniGene) with exon hit 

EST cluster (not In UniGene) 
Hs.120319 ESTs 
Hs.1 36200 ESTs 
Hs.32094 ESTs 

Hs.31082 ESTs; Highly similar to db83 [R.norvegicus] 

Hs.200438 ESTs; Weakly similar to atypical PKC specific binding protein [Rnorvegicus] 

EST singleton (not in UniGene) with exon hit 

CH.07_hs gi|5868290 
Hs.252497 ESTs 

CH.06_hs gi|5868031 
Hs.200151 ESTs 
Hs.1 87726 ESTs 

Hs.1 73933 ESTs; Weakly similar to NUCLEAR FACTOR 1/X [H.sapiens] 

CH.Y_hsgi|5868887 

CH22_FGENES.420_4 
Hs.256809 ESTs 
Hs.1 17182 ESTs 
AFFX control: 18S ribosomal RNA 
Hs.1 2285 ESTs 
Hs.120849 ESTs 
Hs.1 19903 ESTs 
Hs.255917 ESTs 
Hs.209312 ESTs 
Hs.1 32458 ESTs 

EST cluster (not in UniGene) - 
Hs.205591 ESTs; Weakly similar to zinc finger protein Png-1 [M.muscutus] 

CH.06_hs gi|5869131 



Hs.1 16768 
Hs.222917 
Hs.44439 
Hs.32844 

Hs.144928 
Hs.1 91 740 
Hs.1 45553 
Hs.191436 



50 318587 AA779704 

312378 R41582 

318625 T48446 

305181 AA663726 

300815 AA286678 

55 324063 AW292740 

315859 AA682305 

305092 AA642912 

306598 AI000320 

300307 AI651016 

60 321348 Z49979 

325112 AI90377O 
336679 

321383 AJ002574 
337357 

65 300680 AW468066 
327120 

302761 AW250553 

312132 AI475490 

315639 AA827652 



Hs.1 16922 EST 



EST cluster (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

ESTs 

ESTs 

STAT induced STAT inhibitor-4 
ESTs 

EST singleton (not in UniGene) with exon hit 

ESTs 

ESTs 

ESTs 

ESTs; Highly similar to dJ11 18D24.4 [H.sapiens] 
EST cluster (not in UniGene) with exon hit 
CH22_EM:AC005500.GENSCAN212-3 
EST cluster (not in UniGene) 
ESTs 

retinal degeneration B beta 
ESTs 



Hs.246311 
Hs.124344 E 



EST cluster (not in UniGene) with exon hit 
5 ESTs 
3 ESTs 

EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 



EST cluster (not in UniGene) 



Hs.257712 ESTs; Weakly similar to K1AA0986 protein [H.sapiens] 

CH.21_hsgi|6531970 

EST cluster (not In UniGene) with exon hit 
Hs.170577 ESTs 

EST cluster (not in UniGene) 
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318327 AW294013 
324980 AA969121 
319429 A1608881 
310601 AI970543 
318905 Z43395 
323442 AA252753 
304428 AA342250 
313352 AW292127 
316491 AA766025 
317751 AI697668 
314136 AA229781 
306665 AI004614 
303946 AW474196 
313435 AA769123 
317679 AA968799 
322370 AA330095 
306620 AI000929 
329109 

311043 AI8712D9 



Hs.1 87435 ESTs 

EST singleton (not in UniGene) with exon hit 

CH.21_hsgi|6531965 
Hs.1 92007 ESTs 
Hs.33862 ESTs 
Hs.220750 ESTs 
Hs.201379 ESTs 

CH22_FGENES.516_11 
Hs.91662 KIM0888 protein 



309023 AI888045 

310748 AI493675 

316769 AI914939 

320409 AA356195 



Hs.254296 ESTs 

Hs.11482 ESTs; Highly similar to junctional adhesion molecule [H.sapiens] 
Hs.1 92605 ESTs 

EST cluster (not in UniGene) 
Hs.1 64039 ESTs 

Hs.99819 ubiquitin specific protease 16 

Hs.144758 ESTs 

Hs.238794 EST 

Hs.202241 ESTs 

Hs.221962 ESTs 

Hs.1 30577 EST 

Hs.221604 ESTs 

EST cluster (not in UniGene) 
Hs.150289 ESTs 

EST cluster (not in UniGene) 

EST singleton (not in UniGene) with exon hit 

CH.XJisgi|5868626 
Hs.1 77128 ESTs 

Hs.1 58748 ESTs; Weakly similar to synapsin lb [M.musculus] 
Hs.1 84776 ribosomal protein L23a 

EST singleton (not in UniGene) with exon hit 



303889 T19204 
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HS512184 ESTs 2.22 

EST cluster (not in UniGene) 2.21 

CH22_FGENES.87_8 2.21 

Hs.137487 ESTs 2.21 

Hs.145068 ESTs 2.2 

Hs.169436 arginyltransferase 1 22 

Hs.172787 ESTs 22 

EST cluster (not in UniGene) 22 

CH.08_hsgi|5868514 2.2 

EST cluster (not in UniGene) with exon hit 25 

325045 T08845 EST cluster (not in UniGene) 2.2 

312828 AI865455 Hs.211818 ESTs; Moderately similar to !!!! ALU SUBFAMILY J WARNING ENTRY !!!! [H.sapiens] 2.19 

335109 CH22_FGENES.494_15 2.18 

330878 AA131471 Hs.71440 ESTs 2.18 

311289 AI971362 Hs.231945 ESTs 2.18 

304608 AA513456 EST singleton (not in UniGene) with exon hit 2.18 

337393 CH22_FGENES.747-4 2.18 

332812 CH22_FGENES.7_14 2.18 

327665 CH.04_hsgi|5867839 2.18 

314581 AW504859 Hs.237649 ESTs 2.17 

326508 CH.19_hsgi|6682496 2.17 

301242 AW161535 Hs.258803 ESTs 2.17 

312780 AI765651 Hs.172900 ESTs 2.17 
315954 AW276810 
311179 AI880843 
315320 AI084182 
313017 AI015203 
312430 AW139117 
300864 AA406539 
314753 AA463262 
322574 AF156548 
321409 C03864 
321205 AA002047 
320406 AA353895 



Hs.223333 ESTs 

Hs.1 86895 ESTs 

Hs.1 18015 ESTs 

Hs.1 17494 ESTs 

Hs.1 90958 ESTs 

EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
EST cluster (not in UniGene), 

Hs.1 52983 HUS1 (S. pombe) checkpoint homolog 
CH22_EM:AC000097.GENSCAN.1 1-2 
EST cluster (not in UniGene) with exon hi 

Hs.1 86564 ESTs 
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305813 AI066544 
314465 AAB02917 
318168 AIB21782 
315990 AIB00041 
320712 R66867 
318487 AI167877 
317462 AWO15206 
304384 AA235482 
314544 AA399018 
319881 T72744 
328078 

317354 AW090770 
308617 AI738720 
311568 AW439969 
313605 AI761786 
314289 AA848118 



313659 AW296067 

324596 AW149321 

324783 AA640770 

302696 AA347452 

313418 AW450674 



331481 N27448 
335288 

307513 AI274307 

323316 AL134620 

319479 R21945 



314569 AA813784 
332783 W45302 
315259 AA701499 



Hs.143716 
Hs.178784 
Hs.62954 



Hs.218177 
Hs.204674 
Hs.221216 



327574 



305235 M670480 
316055 AA693880 
317194 AW445167 
319565 AW408683 
335146 

301475 AI678183 
312442 AA120970 
322502 R62925 
303693 AA290875 
310179 AI215643 
321121 W23285 
331330 AA282197 
306557 AA994530 
317865 AI298794 
318667 AI493742 
318042 AW294522 



Hs.1 34754 
Hs.103853 
Hs.232150 

Hs.201274 



EST singleton (not in UniGene) with exon hit 
ESTs 

ESTs; Moderately similar to I!!! ALU SUBFAMILY SC WARNING ENTRY !!!! 
ESTs 

EST cluster (not in UniGene) 

ESTs 

ESTs 

ferritin; heavy polypeptide 1 
ESTs 

EST cluster (not in UniGene) 

CH.06_hsgi|5868008 

ESTs 

EST singleton (not in UniGene) with exon hit 
ESTs 



CH22_FGENES.38_7 

CH.12_hsgi|5866967 

ESTs 

ESTs 

EST cluster (not in UniGene) 

EST cluster (not in UniGene) with exon hit 

ESTs 

CH.21_hsgi|6456782 

CH.03_hsgi|5867818 

ESTs 

ESTs 

EST singleton (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

ESTs 

ESTs 

CH22_FGENES.499_2 

prostaglandin E receptor 3 (subtype EP3) 

ESTs 

ESTs 

ESTs 

ESTs 

EST cluster (not in UniGene) 

ESTs; Highly similar to CGI-07 protein [H.sapiens] 

EST singleton (not in UniGene) with exon hit 

ESTs 

ESTs 

ESTs 



ESTs 

CH22_FGENES.581_41 
ESTs 

EST cluster (not in UniGene) 



CH22 EM:AC005500.GENSCAN.148-16 

CH22_FGENES.301_6 

CH.06_hsgi|5868105 



Hs.1 95188 
Hs.227049 



Hs.123001 
Hs.87889 
Hs.148115 



ESTs 

CH.02_hsgi|6004459 
ESTs 

glyceraldehyde-3-phosphate dehydrogenase 

CH22_FGENES.439-19 

ESTs 

EST singleton (not in UniGene) with exon hit 

CH.07_hsgi|6004473 

ESTs 

helicase-moi 
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312110 AI962180 
335864 

320389 W00545 
314065 AA868267 
323086 H15474 
323919 AA862973 
310750 AI373163 
309435 AW090537 
300129 AW028820 



Hs.171785 ESTs 
Hs.85524 ESTs 

Hs.12214 Homo sapiens clone 23716 mRNA sequence 1.96 

Hs.220704 ESTs 1-96 

Hs.170333 ESTs 1-96 

EST singleton (not in UniQene) with exon hit 1 .96 

EST cluster (not in UniGene) with exon hit 1 .96 

320130 AI820675 Hs.203804 ESTs 1-9E 
323787 AW373446 Hs.1 69885 ESTs; Weakly similar to cDNA EST EMBL:T0221 6 comes from this gene [C.elegans] 1 

338112 CH22_EM:AC005500.GENSCAN.185-24 1.95 

313625 AW468402 Hs.254020 ESTs 1-95 

325240 CH.10_hsgi|5866848 1.95 

331833 AA412102 Hs.250911 interteukin 13 receptor: alpha 1 1.95 
332252 N63882 za21 f9.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 

IMAGE:293225 3', mRNA sequence 1.& 

300279 AW237425 Hs.253817 ESTs 1-9£ 

326023 CH.17 hsgi|5867245 1.SE 

321609 H86021 Hs.1 98800 ESTs; Weakly similar to hMmTRAlb [H.sapiens] 1.94 

324183 AA402453 Hs.113011 ESTs 1-94 
336276 CH22_FGENES.762_5 
334913 " '"" " 
325417 

318489 AW043590 
318455 AI148763 



324279 AA5Q1412 

309577 AW168753 
327014 

303488 AW025860 



HS225023 ESTs 

EST cluster (not in UniGene) 

EST singleton (not in UniGene) with exon hit 
Hs.257631 ESTs 



EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 
Hs.42788 ESTs 

Hs.191688 ESTs; Weakly similar to Pro-Pol-dUTPase polyprotein [M.musculus] 

EST singleton (not in UniGene) with exon hit 

CH.21_hsgl|5867664 

EST cluster (not in UniGene) with exon hit 
Hs.129559 EST 

Hs.108447 spinocerebellar ataxia 7 (olivopontocerebellar atrophy with retinal degeneration) 



318674 AA295490 
301267 AW297762 
332092 AA608787 



310094 AW450967 
316218 AW207642 
324774 AI031771 
326507 

314570 AA405696 
336268 

315278 AI985544 
325824 

316277 AA737780 
323181 AM16583 
301438 AA961643 
307050 AI147341 



CH.05_hsgi|5867949 

EST cluster (not in UniGene) 
Hs.255690 ESTs 
Hs.1 12590 ESTs 

EST cluster (not in UniGene) 

EST cluster (not in UniGene) 
Hs.209128 ESTs 
Hs.185861 ESTs 

EST cluster (not in UniGene) 
Hs.166674 ESTs 
Hs.145053 ESTs 

Hs.156110 Immunoglobulin kappa variable 1D-8 

Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (from clone DKFZp727C191) 

" - .1 



Hs.235240 ESTs 
Hs.1 74021 ESTs 
Hs.1 32586 ESTs 

CH.19jlsgi|5867435 

EST cluster (not in UniGene) 

CH22_FGENES.758_2 
Hs.1 16429 ESTs 

CH.15jsgi|5867048 
Hs.213392 ESTs 
Hs.143621 ESTs 
Hs.127716 ESTs 
Hs.146734 EST 

EST singleton (not in UniGene) with exon hit 
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320127 H72615 
337736 

331319 AA262755 

5 310767 AI377505 

314880 AI732169 

312539 AI004377 

309674 AW205604 

314621 AI627478 

10 319495 AI972146 

313472 AA007374 

302705 U09060 
329511 

317140 AI699412 

15 302598 AI815985 

301153 AA725670 

332222 N28271 

330703 AA055475 

20 318470 AI159863 

314014 AW291847 

300370 AI827817 

312329 R84768 
325587 

25 310237 AI8B4313 

318872 R13085 

303431 AA317915 



321279 H85330 

301690 F05865 

307932 AJ230822 

318292 AI679966 

310254 AI239811 

311790 AW016437 

314248 AA278347 



307954 AI419692 
40 302549 AF055136 
321629 H87213 



321533 W78877 
312946 AI915122 
60 319474 H90265 



t OKFZP547GQ910 protein 1.89 

Hs.17268 ESTs 1.89 

CH22_EM:AC000097.GENSCAN.100-2 1.89 

Hs.194264 ESTs 1.88 

Hs.158835 ESTs 1.88 

Hs.105429 ESTs 1.88 

Hs.200360 ESTs 1.88 
Hs.168034 ESTs; Weakly similar to !!!! ALU SUBFAMILY SP WARNING ENTRY !!!! [H.sapiens] 1.8E 



Hs.1 92756 ESTs 



Hs.201925 
Hs.1 29683 
Hs.120485 

Hs.1 76318 
Hs.104143 
Hs.143713 
Hs.121715 



cluster (not in UniGene) 
EST cluster (not in UniGene) with exon hit 
CH.10_p2gi|3983514 
ESTs 

ubiquitin-conjugating enzyme E2D 1 (homologous to yeast UBC4/5) 
ESTs; Weakly similar to serine/threonine kinase with SH3 domain; lew 
zipper domain and proline rich domain [Rsapiens] 
ESTs 

clathrin; light polypeptide (Lea) 
ESTs 

ESTs; Weakly similar to HP protein [H.sapiens] 
EST cluster (not in UniGene) with exon hit 
Homo sapiens clone 25032 mRNA sequence 
CH.12_hsgi|6682462 



EST cluster (not in UniGene) 

EST cluster (not in UniGene) with exon hit 

CH22_EM:AC005500.GENSCAN.349-1 
Hs.191098 ESTs 
Hs.146060 ESTs 

Hs.249180 ubiquitin-conjugating enzyme E2E2 (homologous to yeast UBC4/5) 

EST singleton (not in UniGene) with exon hit 
Hs.150603 ESTs 
Hs.157491 ESTs 
Hs.233462 ESTs 
Hs.126078 ESTs 



324231 W60827 



324661 AW504161 
313190 AA766707 
301979 L28168 
302099 AL021397 
320187 T99949 
320791 R78808 



CH22_FF113D11.GENSCAN.6-4 
EST singleton (not in UniGene) with exon hit 
2 tectorin alpha 

2 ESTs 

EST cluster (not in UniGene) with exon hit 
transcription factor 4 
CH.01_hsgi|5867445 

3 ESTs 

9 ESTs; Weakly similar to E-SELECTIN PRECURSOR [H.sapiens] 
EST cluster (not In UniGene) 



Hs.180920 
Hs.40111 
Hs.204087 
Hs.100636 



CH.07_hsgi|5868316 
EST cluster (not in UniGene) 
ESTs 

potassium voltage-gated channel; Isk-related family; member 1 
ribosomal protein L34 pseudogene 1 
EST cluster (not in UniGene) 

ESTs; Weakly similar to !!!! ALU CLASS A WARNING ENTRY !!!! [H.sapiens] 
CD74 antigen (invariant polypept of MHC; class II antigen-associated) 
ribosomal protein S9 
ESTs 

ESTs; Weakly similar to F33D1 1.9b [C.elegans] 
ESTs 

CH.10_p2gi|3983510 
EST cluster (not in UniGene) 
ESTs 

CH.X_hsgi|5868732 

ESTs 

ESTs 

CH.16_hsgi|5867076 
EST cluster (not in UniGene) 
..... „ ^ 
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300208 AI341180 

319635 R17531 

313454 AA730673 

303093 AI40Q31D 



319845 AA649011 

300290 AI623739 

312180 AI248285 

313058 D81015 



Hs.211417 
Hs.204877 
Hs.244624 
Hs.196115 



ESTs 
ESTs 

CH.05_p2gi|5671884 
ESTs 

CH.10_p2gi|3983507 
ESTs 



ESTs; Weakly similar to FIBRILLIN 1 PRECURSOR [H.sapiens] 
EST cluster (not in UniGene) 



Hs.187902 E 
Hs.186387 E 
Hs.118348 E 



8 ESTs 

EST singleton (not in UniGene) with exon hit 
CH.19 hsgi|5867435 



302345 NMJXX3565 
308100 AI475949 
311386 AW205705 



312486 AA845630 
325450 

321206 H54178 

330977 H20826 

303487 AA333666 

310398 AI264671 

313230 AI540166 

317747 AI683782 

303381 AL038841 



ESTs 
CH.19_p2gi|6671864 
CH.07_hsgi[5868405 
EST cluster (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 
ESTs 

CH.05J32 #671910 

ESTs 

ESTs 

CH.12_hsgi|5866941 

ESTs 

ESTs 

EST cluster (not in UniGene) with exon hit 



Hs.163313 

Hs.208484 
Hs.1 19824 



316002 AW451733 
319850 AA001811 
329941 

328329 CH.07_hsgl|E858375 
322934 AI493054 Hs.158968 ESTs 
325902 CH.16_hsgi|5867101 
322239 W01813 Hs.12109 WD40 protein Ciaol 

— " - — ESTs 

ESTs 
ESTs 



ESTs; Weakly similar to !!!! ALU SUBFAMILY SB WARNING ENTRY !!!! (H.sapiens] 1 .77 
CH22_FGENES.701_8 1-77 
ESTs 1-77 
ESTs 1-77 
ESTs 1-77 
CH. 



331909 AA43730O 
321553 H92449 
301618 T52760 
319592 AA627356 
318511 T26528 
327183 

313516 AA029058 
318644 AI752482 
321632 AA419617 
324657 AW451142 
300437 AW449374 
319775 AA504429 
314775 AI149880 
337460 

309849 AW297444 
301471 AA995014 
312739 AI318426 
319995 H15355 



Hs.129544 
Hs.1 55925 
Hs.60887 



EST cluster (not in UniGene) with exon hit 1 .76 

5 ESTs 1-76 
5 ESTs; Weakly similar to !!!! ALU SUBFAMILY SO. WARNING ENTRY !!!! [H.sapiens] 1 .76 

CH.01_hsgi|5867442 1.76 

5 ESTs 1-76 

EST cluster (not in UniGene) - 1.76 

EST cluster (not in UniGene) 1-76 

ESTs 1.76 

ESTs 1-75 

methyl-CpG binding domain protein 1 1 .75 

ESTs 1-75 

CH22_FGENES.780-5 1.75 

EST singleton (not in UniGene) with exon hit 1 .75 

ESTs; Weakly similar to ORF YLL027w [S.cerevisiae] 1.75 

ESTs 1-75 

ESTs 1-75 

CH.19_hsgi|5867423 1.75 

CH22_FGENES.801-4 1-75 

ESTs 1-75 

ESTs 1.75 

CR21_hsgi|6456782 1-75 

EST cluster (not in UniGene) 1.75 
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324B26 AA704806 

311269 AI656924 

309375 AW075342 

314171 AI821895 



312195 AI300101 
315707 AI41S055 
324349 AW501470 



303714 AW501336 

318704 Z24981 

303027 AF111178 

322601 W92924 

319382 H93199 



324044 AL045752 

320630 AA199847 
327288 

314986 AI201367 

319078 H17255 
326278 

302552 H49792 

322322 AF086431 
327075 



30 317392 AI797588 
300810 AI076890 
315978 AA830893 
323903 AA773580 
330803 AA004699 

35 309845 AW296802 
314963 AI689617 
311710 F09774 
315315 AI984592 



300724 AI762929 Hs.206134 



Hs.33665 
Hs.220540 



EST singleton (not in UniQene) with exon hit 

ESTs 

ESTs 

CH22_FGENES.380_1 

ESTs 

ESTs 

EST cluster (not in UniGene) 

ESTs; Weakly similar to similar to reverse transcriptase [Celegans] 

EST singleton (not in UniGene) with exon hit 

EST cluster (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

EST cluster (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

ESTs 

EST cluster (not In UniGene) 

ESTs; Highly similar to ARYL HYDROCARBON RECEPTOR NUCLEAR 
TRANSLOCATOR [H^apiens] 

Homo sapiens mRNA; cDNA DKFZp434N174 (tan clone DKFZp434N174) 
ESTs 

EST duster (not in UniGene) 

CH.01_hsgl|5867481 

ESTs 

ESTs 

CH.17_hs gi|5867269 

EST cluster (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

CH.21_hsgi[6531965 

ESTs 

ESTs 

ESTs 



Hs.150580 putative translation initiation factor 



Hs.145459 
Hs.1 86949 
Hs.119769 



ESTs; Weakly similar to K1 1C4.2 [Celegans] 
EST cluster (not in UniGene) 



334379 

305854 ; 

313031 N34927 
329728 

312090 N57692 

323341 AL134875 

302077 AA310580 

310766 AI971438 

311450 AI809985 

311792 AW238064 

321500 H71999 

311948 T78791 

302270 R56151 
329089 

322331 AF086467 

318235 AI080361 

304561 AA489792 

312681 AI028149 

310250 AI478629 



321225 AL080073 

322289 AA534550 

319802 AI701489 

314022 AW452420 

314937 AA515602 



HS.11B064 
Hs.192386 
Hs.132898 



1.71 
1.71 



ESTs 

Homo sapiens chromosome 11; BAC CIT-HSP-31 1e8 (BC269730) 

containing the hFEN1 gene 

ESTs 



ESTs 

EST duster (not in UniGene) 1 .71 
ESTs; Moderately smlr to III! ALU SUBFAMILY SQ WARNING ENTRY !!!! [H.sapiens] 1 .71 

EST duster (not in UniGene) with exon hit 1 .71 

CH.X_hsgi|5868614 1.71 

EST duster (not in UniGene) 1.71 

ESTs 1.71 

EST singleton (not in UniGene) with exon hit 1 .71 

pyruvate dehydrogenase kinase; isoenzyme 3 1 .71 

ESTs 1.71 

CH22_EM:AC005500.GENSCAN.219-6 1.71 

CH22_DJ32l10.GENSCAN.11-2 1.71 

Homo sapiens mRNA; cDNA DKFZp564B1462 (from done DKFZp564B1462) 1.7 

ribosomal protein S29 1.7 

ESTs 1.7 

ESTs 1.7 
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300580 AA761322 

304398 AA262785 

313421 AW339515 

309763 AW270182 



EST singleton (not in UniGane) with exon hit 
Hs.163700 ESTs 

EST singleton (not in UniGane) with exon hit 
EST cluster (not in UniGene) 
Hs.121158 ESTs 

EST cluster (not in UniGene) 
CH.01_hsgi|5866841 
Hs.161904 ESTs 
Hs.146406 nitrilase 1 
Hs.43616 ESTs 



315603 AA764768 
325031 T08597 
327157 

314809 AI741461 
320361 H67220 
324721 AW402302 
328624 

303344 AA255977 Hs.250646 ESTs; Highly similar to ubiquitin-conjugating enzyme [M.musoulus] 
CH.08_hsgi|6456775 

5 ESTs 

6 lipophilin B (uteroglobin family member); prostatein-like 
EST cluster (not in UniGene) 
EST singleton (not in UniGene) with exon hit 



2 AA657501 

5 AJ224172 
9 R14537 

6 AW137700 

7 D84424 

6 AA876905 
8 

3 AA354146 
3 AL079289 

7 AI927068 
6 AI472124 
2 AI273815 



Hs.57697 hyaluronan synthase 1 



Hs.137154 
Hs.1 10853 
Hs.157757 
Hs.242463 



ESTs 
CH.07_hsgi|5868485 
EST cluster (not in UniGene) 

Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 35971 

ESTs; Weakly similar to R10D12.12 [Celegans] 

ESTs 



CH22_EM:AC005500.GENSCAN.390-10 

2 AA195405 Hs.1 10347 Homo sapiens mRNA for alpha integrin binding protein 80; partial 

1 R05385 EST cluster (not in UniGene) with exon hit 

3 Z42977 Hs.21062 ESTs 

2 AW244073 Hs.145946 ESTs 
8 AW1 37772 Hs.1 85980 ESTs 
0 CH.14_hS #381953 

321739 AL080280 EST cluster (not in UniGene) 

319808 T58960 EST cluster (not in UniGene) 

313443 AA249037 EST cluster (not in UniGene) 

ESTs 
ESTs 

EST cluster (not in UniGene) 
CH.08_p2gi[5932415 
CHXJisgi|5868602 
CH22_FGENES.318_3 



3 AA424754 
3 AI797592 
3 M081820 



317791 AI801500 

322235 AF086106 

331148 R73816 
325452 

315106 AW452184 
326014 

307130 AI185234 

300943 AA524545 

319402 W21298 

310889 AI457946 

323371 AL135118 



Hs.1 28457 
Hs.17385 
Hs.232100 



EST cluster (not in UniGane) 
ESTs 

CH.12_hsgi|5866941 
ESTs 

CH.16_hsgi|5867160 

EST singleton (not in UniGene) with exon hit 
ESTs 

EST cluster (not in UniGene) 

ESTs; Weakly similar to hyperpolarization-activated; cy 

nucleotide-gated channel 2 [Ksapiens] 

EST cluster (not in UniGene) 

CH22_FGENES.581_4 

ESTs 

CH22_DA59H18.GENSCAN.3-1 



315343 AW205477 
334487 

312169 AI064824 

309668 AW204480 



CH. 
ESTs 



307965 AI421641 
316787 AW369770 
300835 AA401858 



313231 AW139993 



Hs.118112 
Hs.179891 



Hs.193385 ESTs 
Hs.253414 EST 
Hs.248895 EST 

EST singleton (not in UniGene) with exon hit 
Hs.130351 ESTs 
Hs.224843 ESTs 

CH22_EM-.AC005500.GENSCAN517-16 
Hs.154302 ESTs 
Hs.163682 ESTs 
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319901 T77136 
326530 

301126 AI802877 
314043 AA827082 
304387 AA236027 
322932 AA099732 
337272 



318038 AW295409 



301918 AA476777 

315664 AI744068 

304405 AA2B2572 

310624 AI341594 

319250 F11623 

310608 AI962234 

317348 AI348076 



304238 W93278 
317683 AI791700 
311960 AW440133 
312834 AI028309 
325326 

313663 AI953261 
327526 

300429 AW449679 
305169 AA663131 
316621 AI021996 
329866 

318035 AI744130 
300492 AL031709 
316532 AI307229 
332048 AA493019 
307113 AI183686 
319127 N49476 
331155 R87650 
338220 

315763 AW515270 
323571 AA984133 
312240 R28628 
304569 AA490934 
313179 AI076101 
326858 

317276 AI823847 
312572 AA350125 
311932 AW451654 
3D2103 AA452310 
308413 AI636253 
310077 AI620617 
337780 
327796 

308352 AI610791 
324539 AI378032 
303232 AA437414 



CH22_FGENES.327J8 1.65 

Hs.8765 RNA halicase-related protein 1-65 

CH.19 hsgi|5867441 1-65 

Hs.210843 ESTs; WeaWy similar to (JJ1039K5.2 [Rsapiens] 1.65 

EST cluster (not in UniGene) 1 -65 

EST singleton (not in UniGene) with exon hit 1 .65 

EST cluster (not in UniGene) 1-65 

CH22_FGENES.660-1 1-64 

Hs.243901 KIM1067 protein 1-64 

EST cluster (not in UniGene) 1-64 

Hs.256112 ESTs 1-64 

Hs.122858 ESTs 1-64 

Hs.1 37945 ESTs 1-64 

Hs.1 59087 ESTs 1-64 
Hs.1 30720 ESTs; Weakly similar to CELLULAR NUCLEIC ACID BINDING PROTEIN [Rsapiens] 1 .64 

EST singleton (not in UniGene) with exon hit 1 -63 

EST cluster (not in UniGene) 1-63 

Hs.1 45958 ESTs 1.63 

Hs.1 87032 ESTs 1-63 

CH22 FGENES.301_8 1-63 

CH.02Jisgi|5867772 1-63 

EST cluster (not in UniGene) with exon hit 1 .63 

Hs.1 6071 2 ESTs 1-63 

. EST singleton (not in UniGene) with exon hit 1.63 

Hs.157522 ESTs; Moderately similar to env protein [Ksapiens] 1.63 

EST cluster (not in UniGene) 1 -63 

Hs.196102 ESTs 1-63 
Hs.831 3-hydroxymethyl-3-methylglutary|.Coenzyme A lyase (hydroxymethylglutaricaciduria) 1.63 

EST singleton (not in UniGene) with exon hit 1 -63 

Hs.188536 Homo sapiens clone 24838 mRNA sequence 1.63 

Hs.250852 ESTs; Highly similar to ubkjuitin hydrolyzing enzyme I [H.sapiens] 1 .63 

CH.07_hsgl|5868383 1-63 

EST singleton (not in UniGene) with exon hit 1 .63 

Hs.127893 ESTs 1-63 

Hs.189690 ESTs 1-62 

Hs.1 14246 ESTs 1-62 

CH.11Jlsgi|5866875 1.62 

Hs.169813 ESTs 1.62 

CH.02_hsgi|6381882 1-62 

' Hs.158739 ESTs; Highly similar to XG GLYCOPROTEIN PRECURSOR [Rsapiens] 1.62 

EST singleton (not in UniGene) with exon hit 1.62 

Hs.122138 ESTs 1-62 

CR14_p2gi[6272129 1.62 

Hs.131201 ESTs 1-62 

multiple UniGene matches 1 -62 

Hs.184304 ESTs 1-62 

Hs.201591 ESTs 1-62 

EST singleton (not in UniGene) with exon hit 1 -62 

EST cluster (not in UniGene) 1.62 
Hs.33439 ESTs; Weakly similarto !!!! ALU SUBFAMILY J WARNING ENTRY!!!! [Rsapiens] 1.61 

CH22_EM:AC005500.GENSCAN.246-9 1.61 

Hs.1 18342 ESTs 1.61 

Hs.153260 c-Cbl-interacting protein " 1.81 

Hs.203669 ESTs 1-61 

EST singleton (not in UniGene) with exon hit 1 .61 

Hs.131704 ESTs 1-61 

CH.20_hsgi|6552462 1.61 

Hs.129986 ESTs 1.81 

Hs.187499 ESTs 1-81 

Hs.257482 ESTs 1-61 

Hs.26090 ESTs; Weakly similar to T20B 12.1 [C.eiegans] 1.61 

Hs.196511 EST 1-81 

Hs.1 48565 ESTs 1-61 

CH22_EM:AC000097.GENSCAN.121-2 1.61 

CH.05jlsgi|5867982 1-61 

EST singleton (not in UniGene) with exon hit 1 .61 

HS.125B92 ESTs 1-61 

EST cluster (not in UniGene) with exon hit 1.61 
CH22_EM:AC005500.GENSCAN.54-2 1.61 
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303620 AA397546 Hs.1 19151 ESTs 1.61 

303481 AA336339 EST cluster (not in UniGene) with exon hit 1.61 

314481 AA548589 Hs,105846 ESTs 1.61 

300327 AI908894 Hs.245893 ESTs 1.6 

5 323473 AA262442 EST cluster (not in UniGene) 1.6 

326154 CH.17_hsgi|5867170 1.6 

331920 AA446885 Hs.99087 - ESTs; Moderately similar to ZINC FINGER PROTEIN 1 41 [H.sapiens] 1 .6 

323827 AW406878 EST cluster (not in UniGene) 1.6 

322452 W56710 EST cluster (not in UniGene) 1.6 

10 310597 AI739Q71 Hs.158515 ESTs 1.6 

307871 AI338665 EST singleton (not in UniGene) with exon hit 1.6 

322215 AF088005 EST cluster (not in UniGene) 1.6 

318420 AI139857 Hs.143837 ESTs 1.6 

332217 H98987 Hs.1 02383 EST 1.6 

15 324937 M79230 Hs.192398 ESTs 1.6 

320543 AF052176 Hs.158529 Homo sapiens done 24457 mRNA sequence 1.6 

300674 AW467388 EST cluster (not in UniGene) with exon hit 1.6 

315193 AI241331 Hs.131765 ESTs 1.6 

319713 R24204 EST cluster (not in UniGene) 1.6 

20 301210 AI379982 Hs.158944 ESTs 1.6 

309365 AW072861 EST singleton (not in UniGene) with exon hit 1.6 

321403 AW451454 Hs.247568 adenylate kinase 3 1.6 

321908 AA376936 Hs.20998 ESTs 1.6 

303349 AA382661 EST cluster (not in UniGene) with exon hit 1.6 

25 324338 AL1 38357 Hs.247514 ESTs 1.6 

310599 AW300144 EST cluster (not in UniGene) 1.6 

333193 CH22_FGENES.98_15 1.6 

336433 CH22_FGENES.825_12 1.6 

312097 AI352096 Hs.157169 ESTs 1.6 

30 311445 AW204237 Hs.1 92703 ESTs; Weakly similar to!!!! ALU SUBFAMILY J WARNING ENTRY 111! [Ksapiens] 1.59 

317736 AI351722 Hs.192410 ESTs 1.59 

308147 AI498991 EST singleton (not In UniGene) with exon hit 1.59 

313489 AA017492 Hs.135655 ESTs 1.59 

316289 AA902488 Hs.122952 ESTs 1.59 

35 326983 CH.21_hsgi|5867657 1.59 

314781 AW205298 Hs.202372 ESTs 1.59 

328397 CH.07_hsgi|5868397 1.59 

331970 AA461084 Hs.187677 ESTs 1.59 

321744 N91419 Hs.12028 ESTs 1.59 

40 310509 AI292181 Hs.150036 ESTs 1.59 

315921 AI147545 Hs.114172 ESTs 1.59 

322049 AI928242 Hs.144383 ESTs 1.59 

301161 AA731518 EST cluster (not in UniGene) with exon hit 1.59 

300548 AI026836 Hs.1 14689 ESTs 1.59 

45 319142 F07366 EST cluster (not in UniGene) 1.59 

313526 AW152263 Hs.249243 ESTs 1.59 

305937 AA8B3238 EST singleton (not in UniGene) with exon hit 1.58 

330123 CK.19_p2Qi|6671869 1.58 

327819 CH.05_hsgi|5867968 1.58 

50 318250 AI478814 Hs.134603 ESTs 1.58 

306760 AI034094 Hs.169476 tubulin; alpha; ubiquitous 1.58 

322358 AA220235 Hs.246836 ESTs 1.58 

317866 AI690269 Hs.201345 ESTs 1.58 

320725 AA703319 Hs.120967 ESTs 1.58 

55 311332 AW292247 Hs.255052 ESTs 1.58 

334893 CH22_FGENES.452_7 1.58 

318730 AA398215 EST cluster (not in UniGene) 1.58 

315889 AW271639 Hs.221744 ESTs 1.58 

303702 AW500748 Hs.224961 ESTs; Weakly similar to 73 kDA subunit of cleavage and polyadenylation 

60 specificity factor [H.sapiens] 1.57 

315086 AI492660 Hs.170935 ESTs 1.57 

332514 AA156499 Hs.8454 protein kinase; cAMP-dependent; regulatory; type II; alpha 1.57 

335549 CH22_FGENES.576_10 1.57 

329532 CH.10_p2gi|3983505 1.57 

65 323140 AA180467 EST cluster (not in UniGene) 1.57 

313166 AI801098 Hs.151500 ESTs 1.57 

337896 CH22_EM:AC005500.GENSCAN.56-3 1.57 

330658 AA319514 Hs.211093 ESTs 1.57 

324585 AI823969 Hs.132678 ESTs 1.57 
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Hs.208231 EST 



306929 AI124514 
338083 

316868 AI660898 

310937 AI472880 
10 328638 

310074 AI6S1039 
327058 

320076 AI6S3733 

322345 AF086529 

15 314731 AI745498 

318687 H49619 

303841 AI934464 

302370 AJ009849 

322571 AF156271 

20 318050 AI052093 



Hs.148559 
Hs.204079 



Hs.1 99297 
Hs.133132 



318807 I 
334287 

30 311928 AW024798 Hs.233374 ESTs 



CH.19_hsgi|5867307 

1 ESTs 

EST cluster (not in UniGane) 

EST singleton (not in UniQene) with exon hit 

CH22_EM:AC005500.GENSCAN.174-1 

2 ESTs 
0 ESTs 

_ CH.07_hsgi|6004473 

CH^1_hsgi[6531965 
ESTs 

EST cluster (not in UniGene) 
ESTs 
ESTs 

EST cluster (not in UniGene) with exon hit 
Homo sapiens GNAS1 gens encoding NESP55 
EST cluster (not in UniGene) 
ESTs 

EST cluster (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

CH.07_hsgi|5868388 

CH.Y_hsgi|5868874 

EST 

CH22_EM:AC005500.GENSCAN.528-1 
ESTs; Weakly similar to KIM0346 [H.sapiens] 
EST cluster (not in UniGene) 



.17 



314165 
302868 
314034 
40 325389 



AA682913 

AA603092 

AW502851 

AW501163 

H85709 

AA761265 

AA157392 

AI299137 



303347 
315769 
45 317031 



AA744875 
AA973297 
AI827065 
f T26438 
J AW160507 
7 AW138174 
) AF086386 
> AW411383 



312701 A 

304787 A 

55 305849 A 

314557 A 

316507 A 



Hs.1 62017 EST 
Hs.2471 79 ESTs; Weakly similar to KIAA0319 [H.sapiens] 

EST singleton (not in UniGene) with exon hit 
Hs.249978 ESTs 

EST cluster (not in UniGene) 
Hs.21371 son of sevenless (Drasophila) homolog 1 
Hs.221281 ESTs 

EST cluster (not in UniGene) with exon hit 
Hs.154214 ESTs 

CH.12_hsgi|5866921 



ESTs 

EST cluster (not in UniGene) with exon hit 

ESTs 

ESTs 

ESTs 

EST singleton (not in UniGene) with exon hit 

EST cluster (not in UniGene) 

ESTs 

EST cluster (not in UniGene) 
ESTs 

CH.11_hsgi|5B66903 
ESTs 

EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 



Hs.189413 
Hs.126101 
Hs.224877 



Hs.1 30651 
Hs.1 69688 
Hs.1 28127 



) AA513406 
T Z44354 
5 W27919 



324573 A 
3130S2 A 
65 324696 A 



8 AI459140 
S AW151933 
8 AI146423 



Hs.158759 
Hs.161942 
Hs.212827 
Hs.257339 

Hs.129109 

Hs.146709 



EST 
ESTs 
ESTs 
ESTs 

EST cluster (not in UniGene) with exon hit 
ESTs 

EST singleton (not in UniGene) with exon hit 
ESTs 
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315674 M851923 
321861 N79341 
310890 AI184510 
330035 

316907 AA843868 
312299 M972712 
331128 R51361 
305177 M663591 
337685 



317916 AI565071 
332901 

305413 M724659 

316707 AI016387 

313693 AW469180 

316101 AA922236 

320796 AF038966 

307451 AI248615 

323648 AI679968 

331482 N27515 

318059 AI023175 



Hs.191850 ESTs 

EST cluster (not in UniGene) 
Hs.143728 ESTs 

CH.17j)2gi|6042048 
Hs.190567 ESTs 
Hs.174818 ESTs 
Hs.23423 ESTs 

EST singleton (not in UniGene) with ex< 

CH22 EM:AC000097.GENSCAN.77-1 



EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 
3 neural precursor cell expressed; devebpmantally down-regulated 1 
CH22_FGENES.534_7 



Hs.184406 
Hs.170651 
Hs.221037 



314740 AW015667 
314117 AA224368 
301648 AA313954 



Hs.152060 
Hs.40296 
Hs.167022 

Hs.230213 
Hs.1 19427 
Hs.185164 



EST singleton (not in UniGene) with exon hit 
ESTs 
ESTs 
ESTs 

secretory carrier membrane protein 1 
EST singleton (not in UniGene) with exon hit 
ESTs 
ESTs 
ESTs 

CH.16_hsgi|5867142 
ESTs 
ESTs 
ESTs 

EST cluster (not in UniGene) with exon hit 
CH22 EM:AC005SO0.GENSCAN.513-10 
EST singleton (not in UniGene) with exon hit 

3 ESTs; Weakly similar to REGULATOR OF CHROMOSOME 
CONDENSATION [H.sapiens] 

0 EST 

4 ESTs 

1 ESTs 
CH22 FGENES.719J0 
CH.14_hsgi|5867028 
ESTs 

ESTs; Weakly similar to mCAC [M.musculus] 
EST singleton (not in UniGene) with exon hit 
CH.07_hsgi|5868395 

EST singleton (not in UniGene) with exon hit 
ESTs 
ESTs 

EST singleton (not in UniGene) with exon hit 
EST singleton (not in UniGene) with exon hit 
tousled-like kinase 2 
ESTs 
ESTs 



308501 AI685263 
312330 AA635305 
318040 AI018150 
336205 
325701 

315009 AW1 89460 
303121 AW407585 
309271 AI986221 
328385 

307700 AI318545 
314591 AW103292 
304484 AA432067 
304382 AA232873 
304232 W52674 
309853 AW298169 
312504 AW207346 
313134 N63406 
330391 AF015950 
314342 AI873046 Hs.258775 ESTs 

305977 AA387293 EST singleton (not in UniGene) with exon hit 

301165 N85789 Hs.224155 ESTs; Weakly similar to PTERIN-4-ALPHA-CARBINOLAMlNE 

DEHYDRATASE [H.sapiens] 
300613 AI932294 Hs.249604 ESTs; Weakly similar to B-CELL LYMPHOMA 6 PROTEIN [Rsapiens] 
324124 AI554212 Hs.185664 ESTs; Weakly similar to SERIN EffiHREONINE-PROTEIN KINASE NRK2 [H 
ESTs 



323909 AL043148 
315464 AW139500 
303700 AI022056 



311045 A1569399 
315010 AA531082 
310205 AW025248 
310759 AW135924 



Hs.174181 
Hs.186257 
HS.1 16135 



Hs.174746 
Hs.240049 
Hs.202445 



ESTs 

EST singleton (not in UniGene) with exon hit 

CH22_EM:AC0O550O.GENSCAN.107-1 

EST singleton (not in UniGene) with exon hit 

ESTs 

ESTs 



285 



310954 AW449044 


Hs.171293 ESTs 




312019 T77046 


Hs.188750 ESTs 






CH22_FGENES.430_5 




332043 AA490831 


Hs.125055 ESTs 




322950 AA296219 


EST cluster (not in UniGene) 






CH22_EM:AC005500.GENSCAN.67-3 






CH.09_hS gi|5868533 




309245 AI972447 


EST singleton (not in UniGene) with exon hit 




312172 AI222168 


Hs.191168 ESTs 




304039 T47349 


EST singleton (not in UniGene) with exon hit 




301329 Al 149653 


Hs.190496 ESTs 




313376 AI949246 


Hs.200381 ESTs 




324248 AW504918 


EST cluster (not in UniGene) 




308771 AI809301 


EST singleton (not in UniGene) with exon hit 




334935 


CH22_FGENES.464_3 




319764 AA019827 


EST cluster (not in UniGene) 


1.5 


318519 T27135 


EST cluster (not in UniGene) 


1.5 


332807 


CH22_FGENES.7_9 


1.5 


322310 AF086376 


EST cluster (not in UniGene) 


1.5 


324557 M489166 


Hs.156933 ESTs 


1.5 


332118 AA609585 


Hs.162689 EST 


1.5 


319539 R09027 


EST cluster (not in UniGene) 


1.5 


313149 AW291092 


Hs.201058 ESTs 


1.5 


329722 


CH.14_p2gi[6065785 


1.5 


323514 AA861209 


EST cluster (not in UniGene) 


1.5 


308078 AI472621 


EST singleton (not in UniGene) with exon hit 


1.5 


337965 


CH22_EM:AC005500.GENSCAN.1CO-10 


1.5 


335905 


CH22.FGENES.635 13 


1.5 
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TABLE 14A shows the accession numbers for those primekeys lacking unigenelD's for 
Table 14. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 

Accession: Genbank accession numbers 



Pkey CAT number Accession 



2 46678J 

2 212379.2 

3 199797J 



313733 441212J 
322215 47002J 
322235 47070.1 



322347 47545J 

322370 187612J 

321739 43998J 

321781 151177BJ 

314570 280469.1 

300129 635249.1 

322452 497108.2 

321861 1651920.1 



322574 39412.1 
314753 311451.1 
300370 3910.2 



AF085833 R69689 AW341677 AA923375 BE327566 AW630415 R69601 AW615339 

AW962489H64300AA329527 

AA284333 AW4681 19 AA284334 AA810992 

AB040928 T94573 AI289313 AI536039 Z44366 BE141499 D601 16 D61488 D59945 AA419503 R28090 R72986 H03255 
AI1891 12 AI912312 AW51 1018 AI401349 AW470144 C14624 AI335797 Z40300 AI014456 D60269 D601 15 T16722 AI370673 
D50270 

H53744AF075088 H53797 

BE0O4271 AI248023 A1022157 H71999 

AA766346 M809877 AA836116 AW469598 AW977404 

AF038005 N51816 N51731 

AF086106 AI193589 AW665594 N71795 AA722627 AW655373 AI3O0251 

AW812795 AA419617 H87827 AW299775 AW382168 AW3821 33 BE171659 AW392392 BE171 641 AA541393 
AA766825 AAB1 1180 AA085906 AI762946 AW977820 
AF086376 W77804 W72689 AA837735 
AF086386 W77947 W72708 



AF086467 W81444W81445 

W95298 AF086529 AI9121 90 AW294159 AI458747 W94782 

AF086538 W95969 AI63191 1 W95835 

AA330095 W25112AA249401 

AL080280 T73124 H02689 AL080281 

D78667D78871C18258 

AA9Q4776 AA405696 AA405962 

AW028820AI219068 

AI147202 W56755 W56710 

N79341 N99082 N47551 

AA1 80467 AA449184 AA464831 AA505048 

T55958T57205AF147346 

AA011603 N58604 N58611 

NM.016102 AF156271 AA781868 AW152318 AW770403 AA909463 AA48299S AA758672 
AF156548 AA639797 AI675267 AI825497 AI823355 
AA463262 AA463615 AW160405 AW407583 

AW136181 AA581939AK001221AA694538 M424043AI016272AA098960AA884473AI356180 BE391633 AA437086 
AI277866 AA098827 AA992680 BE172624 AA424101 AA320776 AW962957 N77431 AW858960 AW858897 T85649 
AA357743 AI82781 7 AI905672 

AI082395 W92924 BE048524 AW005302 AI084474 AI369330 AI82771 0 AW1 35506 AW298694 

AW160507 NM.013367 AF191338 AA384939 AI445790 AA730309 BE397003 BE267753 AI979163 N50386 AW583671 

AW583608 BE074466 BE074479 BE074471 AW976283 AA604393 AW162122 W73648 AI823475 N7589B W7371 3 

AW470099 AW513236 AW025055 AW613115 AI923379 W58081 AW664525 AW196795 AI143619 AI565152 AA025406 

AA505846 AI685494 AA829964 N59156 N59163 R15442 AAB26919 AI610221 AI200120 AA503279 AW150822 AI189513 

AI807122 AI016368 AI335868 AW683389 AI193892 AI956157 AI628879 AW591589 AW5B3446 AI955406 AW148396 

AI340255 AI867942 AA748525 AA876991 Z38516 AI874002 AI869474 N63100 AA429094 AA082443 

AW105663 AA693880 AW517398 AI768507 BE220851 AW978538 AA831489 

BE219300 BE327455 AL134620 R36741 R17996 

AL031709 AI249061 AA907658 AI420444 
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316141 423880_2 
323371 117336_2 
307700 30923J1 



307783 697809J 
301161 427238J 
324094 270098J 

309023 4737J 



322878 117013J 
301239 457668J 
40 301256 16720J 



300611 337193J 
324157 247225.2 
323509 967739J 



AW303457 AA972713 AA724265 

N45114 N51465 BE087338 AI083551 AL135118 BE395609 

BE280998 BE254670 BE294951 BE564979 AW405364 AA069256 AA129837 AI559667 BE281405 AW410850 BE041153 
A1254811 AW301 340 AI61 3335 AW301411 AI609469 AI611607 AI611616 AI377623 AI335509 AI613544BE043165A1371663 
AI340452 AI612066 AW072890 AI254558 AI349884 AI370095 AI613383 AI61 1946 AI613353 AI307414 AI318229 AI612685 
AW305327 AW268924 AI370063 AI349292 BE049068 AI369098 AW274098 AI344845 AW075187 AI053401 AI345220 
BE138515 AI613386 AI583302 AW301955 AI349661 AI307432 AI054168 AI223913 A1612081 AI348942 AI334539 AI309366 



AI492709 AI335418 AI053999 AI366989 AW073478 AI247058 AI249584 AI305875 AI308585 AW071272 AI271487 AI340719 

AI366995 AI223673 AW271066 AI61 1936 AW071296 AI270796 AI254385 AI251393 AI252562 AW268236 AI254858 

AW071317 AI309102 AI609397 AW268971 AI583267 A1792484 AW075168 BE138443 AI254126 AI309822 AI310872 

AI61 1953 AI251054 AW276658 AI335405 AW075039 AI31 1768 AI612028 AW271895 A1612005 AI312240 AW271082 

AI371642 AI334879 AI310194 AI310772 AI345419 A1334675 AI223914 AI284707 AI284813 AI349140 AI254853 AI313094 

AI310170 AI309499 AI312476 A1376484 AI335467 AI340802 AI309815 AI310168 AI611446 A1345824 BE327775 AI318545 

F17185AW614950 

AW998989AI613519 

AI347274AW844024 

AA731518AA765714 

BE395109 AW663898 AW237041 AI492154 BE046906 AI651285 AI983290 AW002590 AI201040 F32424 AA992272 
AW271836 

AF1 80681 NM_015313 AA229509 AA225792 AA216413 AI888045 BE005205 AB002380 T55518 BE276097 AW380669 
BE1 42836 AW370976 AA479384 R96425 AI680999 AA5951 38 H54582 AI022709 T55440 AI041769 AA861 144 AW392028 
AM79287 M824634 AI638446 H54691 R96382 AA770352 AI640467 AW293491 AA779138 R28298 AA970562 C15590 
R84455 AA020769 AL036394 H80566 BE548861 AA301207 AW959414 AI284253 AA043173 W52429 BE544571 R24852 
Z42603 F13120 R24340 R24326 T75305 H70110 N56255 AA334210 F11453 AW947285 H80345 AA298992 AW380931 
AI267175 Z45421 AW380981 W861 13 AA663590 AA167577 BE566760 BE169166 AA449904 AA459205 N31126 W03564 
N31 208 AW993277 N44765 AW605275 D61449 W68572 AA258190 D60496 AW992964 U46277 H04097 AA370360 
AW957211 AA159775 AI631243 H83367 H21671 D61077 AW392712 N21112 H98522 N45298 N83629 AI393509 AW022043 
AA744886 AI580482 AA723286 AI422244 AI423984 D62804 AI088349 AA587890 AH 44172 N33275 BE074397 H03399 
D62578 AI056639 AI829918 AA579584 AI089460 AI350124 W68573 AI580828 H98897 AI570468 H83715 W861 1 4 AA923123 
D57446 AA043174 AW337721 AI266551 AI140017 AW022356 D79855 D79650 D79393 D60495 AA788666 AAB93443 
AW516977 W60139 AI628156 AW473223 AI608892 AA159670 AW440366 AI421529 T50751 AI174374 AA912234 AA724248 
AW7804O0 M907218 H80514 D57452 AA863419 AA552618 D29S14 R44556 T16452 R44935 Z41 132 D29188 H69692 
AI250176 AI078860 AA370359 AW183108 H74200 AA258183 F10723 C00323 R86148 AA860570 AW130073 AL079946 
AA410327 AA532614 AA234500 AI151507 AA410288 AW969839 AA483232 AI3B3200 AA236540 AI807672 H73441 
AA262442 AA768862 AA262443 

AA827650 AA827652 AW629526 BE044E85 AW974451 AA761439 M848505 AA765803 
AA0B1820AA082191 AAQ79811 
AA807558 AA8271 17 AW629567 

NM.O16603AF251O38 AI124624AA776579 AW298470AI304868AW082724 AB48442 BE218336 N20641 AI018013 
AW858832 AW978157 AA815187 AA932948 AF157316 AI444958 W00848 W02935 AI434933 N26335 AA428681 AW371059 
AI651612 AW134937 AW968911 AA488815 AU57523 W48766 AW936954 AW936941 AW579205 AW936886 AW936889 
N74541 AW936953 AW578421 AW604352 AW367088 AW849258 AW849453 AW371 606 AI5S4921 W49785 H99814 
AA805957 AA904606 AW206696 BE169229 AA333951 AA190704 AW936944 AA463219 AA430306 AW805704 N48503 
BE222307AI638612BE550O45AI8053O4AI69O987AA776841 H12690AW183731 AI380760 AI636261 AA812641 
AW592656 AI686132 AA843424 H99220 AW084996 AW128879 AI800871 AA610135 AA191524 AI150076 AI474530 
AA748461 N29013AA746372 N59606 

N75450 AA877636 AW137945 W05248 AA514763 AW972399 AI758397 AW195051 
AW402931 BE393099 
AL036947 T93676 T85475 

AA641735 AA2818B1 AA861209 AA934756 AA835887 AA641795 AA748822 AW295703 



322957 29014J 



324231 975669J 
324248 977901J 
323691 221757J 



AA3Q1270AA301379AA301366 

R85652 AA114024 AA296219 AA375304 AW963796 AW885952 AW020969 AA114025 AI804930 BE350971 AI765355 
AW317067 AW974763 H85930 AW172600 AI310231 AW612019 D62908 D62664 AA652738 AI674617 AI494064 AW138666 
AI147620 AI147629 AW611793 AI668922 AI971005 AI864742 AA174171 

AK001701 AA134337AA356202BE163251 AW875175 AW875181 AW875177BE163389AKOO0741 AA247755AA120819 
AW868040 AA3091 18 AW962348 AA4712B7 AW996843 AK001452 BE005344 BE617899 AA186588 M120820 AW36331 1 
AA648105 N71529 BE168417 AW673900 AI858160 AA134338 AA659697 N22162 AI335437 AI31 1237 AI343171 AI336661 
AW26B074 AW274348 AA935005 AW576295 AW26262B AW593153 AA730055 AA662650 AA782687 AW894855 AI933533 
AW193002 AW899448 AW890142 AW812670 AA085664 AA334191 BE178085 BE180553 AA389680 AA984772 AA442527 
W26560 BE384359 AA847210 AW304931 AI669606 M085613 AW197240 AI632828 AA581646 AW129348 AI017643 
AW089030 D20893 AI3B2955 AI557148 AW499979 
W60827 AL079968 AL047234 
AW504918 N55410 AL1 18584 AW839266 

AA317561 AI793O0OAW235111 AI793178 AA767397 AI263113 AA719462 
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324330 300543J 
5 300815 41537_2 
324349 1154015J 
323715 225129J 



325031 266373J 
325045 1534945J 
324473 38795J 
323827 235506J 
302270 1734192J 
301618 10967_5 
301646 42154 1 



AA737345AA682286AI799378 
R05385AI061251 
AL1 18754 AA3332Q2 K38001 
AA8B4766 AW974271 AA592975 AA447312 
BE152396 BE152395 AA287515 BE001834 AA286678 AW406477 
AW501470AW502931 A\ 



AW009312 

AA833858 AW978090 AA327679 AA810436 

AF286598AW075342AB028994AL043713AW378914AA340650N57166AW956914R17961 AA336481 BE393734 
AW977867 AW294638 AA927857 AA961627 AW303969 AW894416 AA8121 19 AA912758 AA424355 AA490582 W30941 
AM76693 M131029 AA127777 AL043714 M496984 T511 17 AA127722 AA594012 AI492876 N76483 AW1 19061 BE464926 
AW303419 AI972370 AI768172 AI826550 AI435432 AI37951 6 AA778421 AI276089 AA424521 N59361 AA723153 AA723176 
AI867487 M090677 AI827221 AI351027 W02732 AI810729 AA142848 AI0821 10 N59379 N29744 AI283747 AI148665 
AW779845 AI382967 F34319 AI369934 AI282438 AW183449 AA863467 AA813469 AI092645 AI870701 AA863119 
T65475 R07576T17017 F08143 Z43546 
T08845 Z43538 F06691 

BE560824BE513941AW238907AA580852AW501176BE241846AW501163AW751433AW501340BE241715AI910774 
AW406878 AW965560 AW966151 AW966496 AA336174 AA335376 AA335537 
R56151 W91936 
T52761 T52760 

AJ277841 AI630669 AI804370 Z41 939 AW751251 AA299456 Z44739 AW860471 Z30158AW1 05391 H56997 W84688 
AA491201 WB4636 AA706815 AI131055 AA4S3536 AI005075 AW340034 AI332372 AW1 18195 AI338932 AI191968 
AA693932 AI189982 AI193225 AAB84163 AA594562 W37747 AA249754 M746131 AI916540 AI832188 AW946555 
AA833838 Z40564 M861563 F01447 AA887937 AI933559 AW973250 AA566018 AA313954 
AA354146AI184230AA643525 
AA492588 AA492498 AA492571 



324614 215437J 
324561 385257J 
324685 41003J 



324692 351987J 
40 316893 473541.1 
303027 21796J 

324715 290035_2 

45 324771 385085.1 



65 302697 43219J 
309917 57485.2 



AW207734 D60164 D81150 D81078 D61356 AW996804 
AW503101 AA3091B4N56323R7099B 
AW504161 AW503601 AW505509 

AF226667AA207()32M1()0804M121287M488316AI808218AW419048AI911097AW132123AA502311AW0899^ 
AA1 00952 AI075431 AW083432 AI990554 BE466029 F28643 AF086422 W79581 AW439007 F37179 W79780 AW439035 
AA731381 AW750380 AA251012 AW589846 AA730238 AA329792 AW087255 AA220982 AA082469 AA877260 AA232380 



AA557952 AA677593 AA618150 
AW979189 AA837332 AA856946 AA876935 

AF111178NM 005708 AF105267 AW590040 AI979280 AA001322 BE146329 AA702430 AA702429 AA694221 AI206348 
AI206285 AW770197 AA923032 AI379586 AA701 165 AW594643 AA001909 AW002368 

AI739168 AA426249 AI199636 AW505198 AW977291 AA8245B3 AA883419 AA724079 AI015524 AI377728 AW293682 

AI928140 AA731438 AI092404 AI085630 AA731340 

AA631 739 AA768584 AW1 34477 

AA640770AI683112AA913009 

AF090948 AI064898 AI1 1 1 182 

AB018257 BE148640 AA081832 AK001915 AF150217 AF161350 AI219174 AW074664 D60040 AA346065 H28750 
AW151783 BE613360 BE612628 BE502031 AW183790 AA992580 AA505815 AI310432 AI678015 AW592679 AA879181 
AA806708 AI744110 H24681 C16064 D62900 AI285033 AA346064 AI865123 AW467798 BE221231 AL120676 N89877 
AI928370 AI358387 AA748486 AV64747B AV647450 AA312313 AI279340 AW505099 
AA005122H49792 
AA476777 T86049 

AA437414 AA131479 AA086182 AB037775 AW161C63 AW514393 AA332331 AW136197 BE150789 AA425533 AA249605 
N88308 AI016201 BE004662 AA291027 R57587 AA424277 AA476391 W07532 T97036 AA218898 AW162629 R57770 
W01 278 W90204 W901 56 AL1 1 9197 R8451 3 AA2801 03 AA334994 AW965504 AA460868 AA447470 AW1 38594 W38898 
W90028 AI078353 W90078 AA699696 N35523 AA704225 AA035059 AW134892 AA1 15140 AI142854 H90084 AA826342 
AA460694 N46339 AA425344 N56953 AA035569 AI761 083 AI658696 AI524818 AI338965 AW069249 AW299871 BE464061 
AI189720 AW340682 AI423380 AI275122 H17532 N80735 AA826343 AI039694 BE328398 AI192947 AW271286 AI623122 
AI922902 AW293087 N22141 AA730657 AW316610 N26473 F06663 Z43610 H14783 R59761 H11540 AI265915 AI681773 
AI091748 BE22063S AW841861 AI702181 AI468447 AA907544 AI273941 AW244034 R37769 AA446663 T96929 BE045884 
AA476341 H89994 H29043AW051211 N49522 AA306977 

AKQ0Q738 AA3474S2 AW961713 H70832 AI750643 AA362887 AW955588 W44974 AA279599 AW298762 AA452666 
AA443355AI337273AA446931 AI752977 AA661554 W42674 AI292172 R41163 AA621381 AI244157 
AJ001409AJ001410 
AW340014 AW866993 AV651649 



AW300144 AI338491 AI798381 BE220076 
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318455 606341.1 
317850 363835.1 

303431 32082.1 



319127 1653640J 

303480 232749J 

303481 31534J 



AW250553 L07876 Z36843 R30693 AI190097 AW965317 



318519 434741.1 
304168 72494.-10 
302948 21445J 



318704 799152 1 

318730 275116.1 

303714 1155758J 

304387 183612.1 

304398 10169.1 



318807 1536467J 
319478 765461.1 
318872 1534581.1 



AI681545 AI951714 AI570397 AW873588 AA836396 AI359986 AI499790 M773477 AI951615 T07547 AW304709 AF1 14041 
BE176629 Z44580 T30422 T32690 AW953065 H1 0602 

NM.000539 AA01901 3 AA01 9367 AA056154 H38735 AA057003 M021 051 H381 02 AA015774 AA059291 M019439 H84843 
H83375 AA019914 AA017288 R84449 W26519 H38258 AA018736 H84U7 AA018577 M059353 U49742 H38767 AA318341 
AA317553 H86646 H91989 AA317398 AA317378 W29024 W23034 T27877 AW950059 AA017195 R84262 AA057177 
H89941 AA01 9904 H84662 AA015775 AA019368 AA020976 H37900 C20733 H38682 H851 97 M018578 AA017252 
AA019440 AAO59059 H38651 H84148 AA018560 W25754 C20752 AA317915 AW9521 15 AA317369 M019845 R85402 
AA019492 AA017196 AA056093 AA056094 AA058836 AA056155 W25957 W23027 AA056159 W23043 W21890 W28951 
AA317978W26459AA317265 
N49476 Z45911 R21061 
AA331906 AA332484 

AK001952 AA336839 AW249271 BE247287 AF182002 BE613472 AW962673 AA332235 AW849937 AW849814 H49893 
AA477148 AW968944 AF182003 AW007897 BE246145 W76100 AI480141 AW410205 AA609339 A120911 1 AW000979 
AA330280 AW961554 W72865 H49894 AA514317 AA620407 AA504522 AW472833 AA716609 AW129282 AA347351 
AA628378 AW589860 A1636696 AA464632 AA464533 AW874189 AA757076 AA479654 AW517910 AW292357 AW872638 
AW26228B AI910666 AW513749 AW238771 AA215797 BE387073 

BE143533 AW850432 AK000042 AA333666 AA385314 AW966616 AW793068 AW793414 AA361103 AW390841 M040095 
AW385058 AW799162 AI3331 15 AI990745 AI653703 BE503693 AW150758 AI949919 AW190450 AW512348 AI625970 
AW501057 N52954 AI281 378 AI4O1710 AI648409 AW002659 AI687639 AI093943 R33960 AA040062 AI926267 AI240425 
AI520911 AI093428 R52943 

AI040372 AB040915 W40569 BE158910 BE158914 D63226 AW025860 AW583088 AA334307 AA210942 AW753212 
AW805322 AA362635 BE158911 AW891225 AW994862 AA805451 R28541 AA229347 N48266 AI377788 R28682 R36122 
AA811941 AI240742AI632001 T99965 W01976 AW891205 AW891177T97433C15571 AA346850AA504293 W07500 
AI694503 M489216 AA327725 AW959917 AA694146 N68514 AI076285 AW016246 T07783 AA642400 AA716133 AAB05332 
R00312 AA705021 AW498605 AW891723 AW891906 AA808025 N29039 N74897 W60393 AA810184 AI627460 AW057516 
AA807436 AA760966 AI359295 N78642 N20662 AA830300 W81705 AA832258 AW891718 AI811796 AW515523 Z41735 
AA449978 AW891714 AI684539 AW891896 AW071701 AI890916 AI924994 AI039743 AA8B8524 AA244214 AI015736 
AI270105AI865077 

F30712 F35665 AW263888 AI904014 AI904018 AA336927 AA336502 
H08370 Z46168F07366AA193168AA193138 

AK000290 AI476034 AA465309 BE148761 AW303607 AW958665 AW469635 AI819365 AI243857 AW469326 AA1571 10 
AA278626 AA496257 AA306656 F29732 AA831859 AA312210 AA564476 AA579065 AA769522 AA740386 AI205635 
AA491643 AA81 0400 AA41 7708 AI567332 AA157392 N53817 M374229 



H13364T27135 R61679 AA746905 
H77679 

AB038995 NM.016530 AKQ01 1 1 1 AA465635 AW968716 U66624 AA885459 AA703019 AI040266 AI018689 AI692886 
AI125372 AI376796 AI192040 N58161 AL133607 AW503673 AW505479 AA362265 AJ404671 
F11623H17552 M347728 

BE311816 AKO0O916 AW868037 AW868039 AF228527 AI752482 AW868041 AA077049 AI201537 W55873 AA205019 
AA077918 AW968729 AI978328 AW139S20 AI093053 AW204025 AI418805 AA598926 AA586345 AA045669 BE314455 



W01 166 AW996900 BE1 84300 Z44887 T34535 R51495 AW886575 AA295490 AA295162 AA295163 AW937125 T56951 
BE386106 W52674 

AW500106 BE241915 AW503971 NM.016542 AB040057 AA313812 AK000556 W16504 AI822088 AA259107 AA191319 



AW673897 AW953685 N22323 AA649166 A1377099 H03061 AI660072 AW276405 AA809779 AI803430 AW297484 
AW510384 AA814816 AA371522 D63035 M953567 R79392 R24282 AA876831 AW297542 AI699023 AA992652 AI041436 
AI631602 AW589676 Z28684 Z24981 



AW501336AW501337 
AA236027BE003275 

AA1 95509 BE394661 AV660757 AA489161 BE165972 AW503705 AA262785 AF123320Z78357NM.014171 AF161488 

AA248971 BE568575 AA461410 AA165108 AI637731 H75454AA372934AW339334BE568754BE564697BE567299 

AI681606 BE537269 AW197204 AA290890 AI189393 AW292463 AW470227 F27399 AW61 1942 BE566888 AW301701 

AI675761 AI628429 AA16471 1 AI797753 AI656879 AI912690 AI675277 AI695099 AI094095 AW014158 BE091O59 AI201748 

AW236961 AI038003 AI083606 M401606 AI0794O5 AI073516 AI655537 AA401475 AI814532 AI079862 AI093789 AI422084 

AI216476 AI392760 AA926998 AA781782 Z25198 A1086377 Al 185511 A1185539 Z28843 AI223792 AI379563 M706253 

AI433798 AI921885 H75455 AW025269 AI224100 AI083611 AI225057 AW1 96334 AI572254 AA761628 AI472801 AA283784 

AA830149 AW978407 M85983 AW503637 

W00973 N56457AW992226 T84921 R01342 

R86913 R86901 H25352 R01 370 H43764 AW044451 W21298 

F08434Z42573 H28810 

A1524124 R06841R06842 

Z43108F06295R13085 
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303841 79133J 

303889 17771 83_1 

319539 63198J 

318905 1536408 1 

320187 39S254J 

318998 65715J 

319635 163534J 

319699 747196J 

319713 1699356J 

319761 75324_2 

319764 88596J 

319808 7069_3 

321040 193331J 

320409 43709J 



319881 1585983J 
320488 368456J 
321121 1545647.1 
321205 81249J 
321253 375160J 
314043 155125J 
320630 17685.2 
313435 443527J 
313443 82292.1 
313472 82811.1 
321348 41762.1 
314138 179960J 
320712 57156.2 
321383 41924.1 

312996 187327.1 



AA742999 Z43272 AA345258 AW956677 AA031942 

W19657 BE616760 BE259848 BE382680 BE615587 AI934464 M322745 T07155 AW961174 AA307302 Z41888 M621992 

M188400 AW770608 AI147458 AI14840B AI696291 AA972591 

T19204T36109T36107 

R09027 AA344892 AA329574 AW955648 AW978708 AI567804 AI378935 AW014657 AI804134 R08922 N92947 BE546788 



Z44266 H06384AV655948 

R17531 AW960899 AA338366 AW673294 BE047729 BE047722 AA330746 AW841797 H05030 AI142105 R12654 

AI458682 H24240 R14537 R18426 AW867082 

R24204R15712T84695 

AW630974 BE005208 R84237 AA724997 AA334867 AW955777 R18816 

AA019827R18947 H46852 

T58960 AA609180 AA621130 AI927236 AA431075 



AA226869 AA29651 6 AW959753 AA186390 AL35961 9 AA3561 95 AA148427 R22748 AIQ33624 BE548853 H95327 
AW579751 BE561649 AA397533 BE617136 AA236444 T89946 AA247450 N55777 W38725 AI743846 AI80B406 AA922229 
AI051464 W04713 R1 1251 W19656 AI042319 AA489276 AI224533 H95274 AW269958 T8931 1 AI890088 AI862754 
AI830968 AI669336 AI589780 AA534557 AW273839 AI338155 AI126632 N83542 BE046048 AA807028 AA848107 
AW1 67978 AA976930 M148428 AI289304 AI524262 AI625961 AA773469 AI222288 AI280054 AI242371 AA227222 
M973329 AA29651 7 AA829436 AA234526 AI149769 AI567865 AA936939 AI590681 AW469308 AI689531 AA486419 
AI422051 AI057252 AA626941 A1475352 AW247913 AI222370 M670122 AW198034 AA486418 AI363794 AA380739 
H51299 H44619 H46391 R86024 H51892 T72744 
A1817336 R32883 AA595590 AI743065 R31386 
W23285 H42714 F25381 F37215 
AA002047 N72537 H54142 H81580 
AA610649AI699484 H59558 
AA827082 M732246AA167611 AA830741 

AA199847 AA410224 R53323 AW936567 AW936569 AW936568 AW936571 
M769123 AA831715 AW977666 W92553 



AA007374AA00746BAI816886 
Z49979 D61703 U30168 
AA740616 AA654854 AA229923 
R66867 R65678 R82673 W73128 R83101 

AW968556 AJ238555 AW968731 AJ002574 AA459446 H70260 AW977557 M767351 AW268572 M81 071 9 AI698677 
AI30Q460 AA907450 AA649224 T07415 AI536896 BE01 8515 AI279865 BE047421 
AW368634 AI702169 AI245179 AW368646 BE545574 AA249018 AW368633 N27553 



AA991705 



306855 

329722 c14j>2 
50 329728 C14J32 



55 308352 
308383 
308521 



AI475949 
AI498991 
AI124514 
AI610791 
AI624497 
AI689808 
AI701559 



303019 41850.1 

303084 44211.1 

305092 AA642912 

305169 

305177 

305235 

305413 



AF174008 AF174027 AF174106 
AA663131 
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307451 
S07513 
307848 



AI185234 
AA8B3238 
AA8B7293 



325417 c12_hs 
325450 c12_hs 
325452 c12_hs 



302705 31765J 

304037 

304039 



AI418246 
AI419692 
AI421641 
AI972447 
AI986221 
AW072851 
AW074330 
AW090537 
AW137700 
AW151933 



AW292760 
AW296076 
AW297444 



304787 
304921 

327819 c_5_hs 



AA614308 
AA968967 

AW780192AA015718W02571 
N63882 T91174 
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TABLE 14B shows the genomic positioning for those primekeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: Unique number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


332807 


Dunham, l.etal. 


Plus 


297686-297808 


332808 


Dunham, l.etal. 


Plus 


298277-298360 


332812 


Dunham, 1. etal. 


Plus 


309688-310561 


332901 


Dunham, 1. et.al. 


Plus 


1841954-1842090 


333149 


Dunham, 1. et.al. 


Plus 


3574317-3574413 


333916 


Dunham, 1. et.al. 


Plus 


8298994-8299169 


334026 


Dunham, 1. etal. 


Plus 


9196549-9196681 


334061 


Dunham, 1. etal. 


Plus 


9686941-9687077 


334073 


Dunham, 1. etal. 


Plus 


9792201-9792374 


334150 


Dunham, 1. etal. 


Plus 


10529221-10529854 


334379 


Dunham, 1. etal. 


Plus 


13908356-13908467 


334719 


Dunham, 1. etal. 


Plus 


15778859-15779026 


334773 


Dunham, l.etal. 


Plus 


16235169-16235328 


334893 


Dunham, l.etal. 


Plus 


19302753-19302881 


334935 


Dunham, 1. etal. 


Pius 


20108247-20108373 


335146 


Dunham, 1. etal. 


Plus 


21491292-21491457 


335320 


Dunham, 1. etal. 




22542132-22542246 


335568 


Dunham, 1. etal. 


Plus 


24935021-24935655 


335586 


Dunham, 1. etal. 


Plus 


24990333-24990497 


335601 


Dunham, l.etal. 


Plus 


25044923-25045157 


336036 


Dunham, l.etal. 


Plus 


29019796-29019877 


336123 


Dunham, l.etal. 


Plus 


30051089-30051186 


336268 


Dunham, l.etal. 




31997555-31998040 


337173 


Dunham, l.etal. 


Plus 


23624127-23624224 


337460 


Dunham, l.etal. 


Plus 


32536159-32536395 


337685 


Dunham, l.etal. 


Plus 


3547161-3547245 


337736 


Dunham, 1. etal. 




3850500-3850643 


337780 


Dunham, l.etal. 


Plus 


4113793-4113990 


337965 


Dunham, l.etal. 


Plus 


7034267-7034392 


337976 


Dunham, l.etal. 


Plus 


7166011-7166119 


338030 


Dunham, l.etal. 


Plus 


8072708-8072827 


338112 


Dunham,l. etal. 


Plus 


10391398-10391600 


338165 


Dunham, l.etal. 


Plus 


12205719-12205875 


338178 


Dunham, 1. etal. 


Plus 


12800037-12800181 


338427 


Dunham, 1. etal. 


Plus 


19685043-19685354 


338506 


Dunham, l.etal. 


Plus 


21221871-21221953 


338794 


Dunham, 1. etal. 


Plus 


27114697-27114763 


338910 


Dunham, 1. etal. 


Plus 


28795375-28795551 


339047 


Dunham, 1. etal. 


Plus 


30760793-30760968 


332864 


Dunham, 1. etal. 




1390386-1390296 


332933 


Dunham, 1. etal. 




2035790-2035681 


333193 


Dunham, l.etal. 


Minus 


3832993-3832494 


333712 


Dunham, l.etal. 


Minus 


7286177-7286073 


333940 


Dunham, l.etal. 


Minus 


8523830-8523571 


333942 


Dunham, l.etal. 


Minus 


8552629-8552330 


334287 


Dunham, l.etal. 


Minus 


13294116-13293871 


334387 


Dunham, 1. etal. 




13946021-13945781 


334487 


Dunham, 1. etal. 


Minus 


14432191-14432132 


334913 


Dunham, 1. etal. 


Minus 


19463909-19463815 


335109 


Dunham, l.etal. 


Minus 


21325792-21325667 


335250 


Dunham, 1. etal. 


Minus 


21952922-21952826 
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335288 Dunham, I. etal. 
335290 Dunham, I. etal. 
335549 Dunham, I. etal. 
335862 Dunham, I. etal. 
335864 Dunham, I. etal. 
335905 Dunham, I. etal. 
336205 Dunham, I. etal. 
336276 Dunham, I. etal. 
336433 Dunham, I. etal. 
336605 Dunham, I. etal. 
336616 Dunham, I. etal. 
336679 Dunham, I. etal. 
337043 Dunham, I. etal. 
337272 Dunham, I. etal. 
337357 Dunham, I. etal. 
337393 Dunham, I. etal. 
337497 Dunham, I. etal. 
337646 Dunham, I. etal. 
337920 Dunham, I. etal. 
338083 Dunham, I. etal. 
338220 Dunham, I. etal. 
338752 Dunham, I. etal. 
338763 Dunham, I. etal. 
338983 Dunham, I. etal. 
339209 Dunham, I. etal. 
325240 5866848 
329532 3983505 
329522 3983507 
329519 3983510 
329511 3983514 
325326 5865875 
325303 586690B 
325389 5866921 
325417 5866925 
325450 5866941 
325452 5866941 
325498 5866967 
325587 6682462 
325602 5866994 
325701 5867028 
325780 6381953 
329722 6065785 
329728 6065785 
329566 6272129 
329815 6624888 
329B41 6672062 
325824 5867048 
325866 5867076 
325902 5867101 
325958 5867142 
326014 5867160 
329941 6165199 
330002 6623963 
326154 5867170 
326023 5867245 
326278 5867269 
330036 6042048 
326547 5867307 
326495 5867423 

326507 5867435 

326505 5867435 

326506 5867435 
326530 5867441 

326508 6682496 
330120 66718B4 
330123 6671869 
326858 6552462 
326983 5867657 
327014 5867664 



Minus 22304275-22303770 

Minus 22309950-22309891 

Minus 24666203-24666128 

Minus 26690300-26690125 

Minus 26694537-26694382 

Minus 26988888-26988719 

Minus 30477456-30477311 

Minus 32093320-32093181 

Minus 34067540-34067425 

Minus 15616509-15616358 

Minus 26021027-26020848 

Minus 2035790-2035681 

Minus 17407330-17407251 

Minus 28241476-28241307 

Minus 30906179-30906109 

Minus 31471747-31471569 

Minus 33371317-33371258 

Minus 2648689-2648632 

Minus 6051648-6051510 

Minus 9318438-9318301 

Minus 14166440-14166104 

Minus 26421374-26421135 

Minus 26628148-26628009 

Minus 29908865-29908702 

Minus 32492953-32492593 

Minus 32301-32650 

Plus 42937-43014 

Minus 35265-35458 

Plus 18407-18597 

Plus 20965-21325 

Plus 47726-48024 

Minus 73556-73630 

PIUS 239672-239759 

Minus 110635-110745 

Minus 435379-435552 

Minus 704103-704202 

Plus 173372-173930 

Plus 126724-126967 

Plus 79122-79251 

Minus 72936-73046 

Plus 63634-63873 

Minus 112713-112992 

Minus 207544-207741 

PIUS 98307-98446 

Minus 68431-68720 

Minus 40181-40331 

Minus 42450-42833 

Minus 94358-94628 

Minus 127729-127842 

Plus 53437-53550 

Minus 10358-10447 

Minus 34319-34411 

Plus 46097-46158 

Minus 7103-7179 

PIUS 171799-171896 

PIUS 75250-75903 

Plus 117120-117216 

Minus 623677-623870 

Plus 11843-11930 

Minus 13038-13111 

Minus 8918-8949 

Minus 9368-9509 

Minus 303000-303122 

Plus 78904-79112 

Minus 127553-127656 

Minus 35311-35406 

Minus 69337-69670 

Minus 16023-16581 

Plus 1017630-1017788 
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326930 6456782 

326920 6456782 

327058 6531965 

327061 6531965 

327075 6531965 

327120 6531970 

330126 6093735 

327157 S866841 

327183 5867442 

327192 5867445 

327288 5867481 

327469 5867772 

327489 6004459 

327526 6381882 

327574 5867818 

327665 5867839 

327752 5867949 

327819 5867968 

327796 5867982 

330260 6671884 

330282 6671910 

328078 5868008 

328121 5868031 

328190 5868077 

328227 5868105 

327871 5868131 

328018 5902482 

328624 5868246 

328744 5868290 

328799 5868316 

328291 5868363 

328329 5868375 

32B369 5868388 

326385 5868395 

328397 5868397 

328412 5B68405 

328538 5668485 

328656 6004473 

328638 6004473 

32B903 5868514 

328960 6456775 

330320 5932415 

328993 5868536 

329081 5868602 

329089 5868614 

329109 5B68626 

329192 5B68716 

329218 5868726 

329224 5B68728 

329246 5868732 

329415 5868874 

329454 5868887 



PIUS 606950-607705 

Minus 42425-42519 

Plus 2384268-2384835 

Minus 3486389-3486673 

Plus 4041318-4041431 

Minus 6-108B 

Plus 82458-82623 

Minus 4408-4746 

Plus 84317-84531 

Minus 194652-194764 

Plus 48583-43773 

Plus 145549-145708 

Minus 57796-58015 

Minus 97010-97123 

PIUS 68767-69126 

Plus 141736-141900 

PIUS 93721-94421 

Minus 92202-92717 

Plus 85267-85405 

Plus 45203-45269 

Plus 3982-4114 

Plus 72807-72865 

Plus 153782-153850 

Plus 21082-21165 

Minus 21082-21242 

Minus 88889-89221 

Minus 542547-543133 

Minus 120666-120836 

Plus 138639-138722 

Minus 80771-80923 

Minus 144244-144434 

Plus 191709-192239 

Plus 75371-75583 

Plus 369952-370155 

Plus 344967-345063 

Plus 86427-86519 

Plus 3814-4243 

Plus 792616-792729 

Plus 294618-294903 

Plus 23625-24468 

Plus 38547-38837 

Minus 54458-54697 

Plus 49160-50084 

Plus 93368-93510 

Plus 25805-26923 

Plus 102168-102273 

Plus 166936-167020 

Minus 71408-71707 

Plus 27422-27654 

Minus 250541-250792 

Plus 1011438-1011818 

Plus 51342-51593 
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TABLE 15: 169 GENES WITH SEQUENCE INFORMATION DEPICTED IN TABLE 16 

Table 15 depicts UnigenelD, UnigeneTitle, Primekey, Predicted Cellular Localization, and 
Exemplar Accession for all of the sequences in Table 16. The information in Table 15 is 
linked by EosCode to Table 16. 



ExAccn: 

UnigenelD: 

UnigeneTitle: 

EosCode: 

Localization: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Internal Eos name 

Predicted cellular localization of gene product 



Pkey ExAccn UnigenelD UnigeneTitle 



102393 U42359 

102522 U53347 

102669 U71207 

103119 X63629 

103709 M037316 

104080 AA402971 

104144 AA447439 

104691 AA011176 

105370 AA236476 

106149 AA4248B1 



Hs.183556 
Hs.29279 
Hs.2877 
Hs. 13604 
Hs.57771 



Hs.37744 
Hs.22791 
Hs.256301 



109014 AA156790 

109112 AA169379 

109890 H04649 

110151 H18836 

112971 T17185 

113021 T23855 

114908 AA236545 

114965 AA250737 



116416 AA609219 

117698 N41002 

117984 N51919 

118935 N94303 

119018 N95796 

119126 R45175 



121710 AA419011 
121913 AA428062 
122041 AA431407 
122593 AA453310 
123209 AA489711 
124526 N62096 
126399 AA128075 
126645 AI167942 
126966 R38438 
127537 AA569531 
128790 AA291725 
129109 AA491295 
129184 W26769 
129389 AA621604 



Hs.129836 
Hs.54973 
Hs.72472 



Hs.278695 
Hs.117183 
Hs.97594 



Hs.98732 
Hs.128749 
Hs.203270 
Hs.293185 



Hs.105700 
Hs.108708 
Hs.109201 



CD38 antigen (p45) PBC1 
KIAA0268 protein PAB7 
protein kinase C, iota OAA1 
selectin E (endothelial adhesion molecul ACC5 
RAB3B, msmbar RAS oncogene family PFJ2 
midkine (naurite growth-promoting factor LBH9 
gb:Human N33 protein form 1 (N33) gene, PDG3 
solute carrier family 1 (neutral amino a PFJ4 
eyes absent (Drosophila) homolog 2 LEM9 
cadherin 3, type 1 , P-cadherin (placenta LBG2 
hypothetical protein dJ462023.2 PD06 
kallikrein11 PBA6 
hypothetical protein FU13590 PDM3 
Homo sapiens beta-1 adrenergic receptor PAV1 
transmembrane protein with EGF-like and PDM9 
hypothetical protein MGC13170 PD08 
ESTs PAA4 
K1AA1344 protein PAA3 
DKFZP586E1621 protein PDG8 
ESTs PBF1 
ESTs, Weakly similar to Z223.HUMAN ZINC 
hypothetical protein FU13782 BCU4 
Homo sapiens cDNA FU11245 fe, clone PL 
hypothetical protein FLJ20041 PAV9 

PD03 
PFJ6 
BCY2 

hypothetical protein MGC2648 PDV3 
ESTs OAB6 
ESTs PDT9 
ATPase, Ca++ transporting, type 2C, memb 
ESTs, Weakly similar to I54374 gene NF2 PDM8 
Homo sapiens prostein mRNA, complete cds 
ESTs PBF8 
KIAA1210 protein PDG5 
prostate androgen-rsgulated transcript 1 PDV5 
ESTs; protease inhibitor 1 5 (PI15) BCU7 
Homo sapiens Chromosome 16 BAC clone CIT 
alpha-melhylacyl-CoA racemase PD01 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
ESTs, Weakly similar to JC7328 amino aci PAV4 
transmembrane, prostate androgen induced 
six transmembrane epithelial antigen of PAA5 
solute carrier family 1 5 (H+/peptide tra PD05 
ESTs PAA6 
secreted frizzled-relatad protein 4 BCX2 
calcium/calmodulin-dependent protein kin PFJ7 
CGI-86 protein PAV6 
spondin 2, extracellular matrix protein CJA5 
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plasma membrane 

plasma membrane 
not determined 



PDG7 
rot del 
PD34 



PAJ5 not determined 
-PAB2plasma membrane 

vesicular 

PAZ1 not determined 

PAA2 plasma membrane 
plasma i ' 
PDY4 
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4 AA172056 

4 R73640 Hs.11260 
0 AA128997 Hs.18953 

5 AA219134 Hs.26691 



133944 
134110 
301B05 



310431 
310573 
310598 

25 310816 
311593 
313676 
314121 
314691 

30 314785 
314907 
315051 
315052 
316442 

35 317548 
317869 



321441 
322303 
45 322782 



324617 
324626 
324658 
55 324718 



3 U42360 

3 X74331 

4 U07919 
4 U07919 



Hs.71119 
Hs.74519 
Hs.75746 
Hs.75746 
AA045870 Hs.7780 
U41060 Hs.79136 
AI800004 Hs.142846 
AI869566 Hs.123119 
AA508353 Hs.105314 
AA340605 Hs.105887 
D30891 Hs.19525 
AW503733 Hs.9414 
AI460004 Hs.31608 
A1734009 Hs.127699 
A1420227 Hs.149358 
AW292180 Hs.156142 
AI338013 Hs.1 40546 
AI973051 Hs.224965 
AI682088 Hs.79375 
AA861697 Hs.120591 
AI732100 Hs.187619 
AW207206 Hs.136319 
AI538226 Hs.32976 
AI672225 Hs.222886 



331099 
331490 
331889 



AA876910 Hs.134427 
AA760B94 Hs.1 53023 
AI654187 Hs.195704 
AW295184 Hs.129142 
AW291511 Hs.159066 
AF071538 
AM60775 Hs.6295 
AF071202 Hs.1 39336 
NM_006953Hs.159330 
AF038966 Hs.31218 
AW297633 Hs.1 18498 
W07459 Hs.1 57601 
AA056060 Hs.202577 
AW043782 Hs.293616 
AF055019 Hs.21906 
AA639902 Hs.104215 
AI146686 Hs.143691 
AA464018 Hs.1 84598 
AW016378 Hs.292934 
AA508552 Hs.1 95839 
AI685464 

AI694767 Hs.129179 
AI557019 Hs.1 16467 

U31382 Hs.299867 

AA449677 Hs.1 5251 

T48536 Hs.1 22764 

M149579 Hs.91202 

H36671 Hs.14846 

N32912 Hs.291039 

AA431407 Hs.98802 
N58172 



65 332396 AA340504 
332697 T94885 



ESTs PAB4 
hypothetical protein FU1 1264 PAJ3 
phosphodiesterase 9A PEE6 
ESTs PBA7 
ESTs PAA7 
six transmembrane epithelial antigen of PM17 
homeoboxB13 PFJ5 
Putative prostate cancer tumor suppresso PDM1 
primase, polypeptide 2A(58kD) PDM2 
aldehyde dehydrogenase 1 family, member 
aldehyde dehydrogenase 1 family, member 
Homo sapiens mRNA; cDNA DKFZp564A072 (fr 
LIV-1 protein, estrogen regulated BCR4 
hypothetical protein PEU4 
MAD (mothers against decapentaplegic, DrPBJB 
retaxin1(H1) PBH3 
ESTs, Weakly similar to Homoiog of rat Z PE64 
hypothetical protein FU22794 PBM4 
KIAA1488 protein PBY3 
hypothetical protein FU20041 PEU5 
KIAA1603 protein PCQ8 
ESTs, Weakly similar to A46010 X-linked PBH1 
ESTs PEN3 
ESTs PCW3 
ESTs PET5 
hobcarboxyiase synthetase (biotin-[prop PBH8 
ESTs PBY2 
ESTs PBY1 
ESTs BFF8 
guanine nucleotide binding protein 4 CB07 
ESTs, Weakly similar to TRHY.HUMAN TRICH 
ESTs PBM9 
ESTs PBJ7 
ESTs PBJ9 
ESTs PBQ6 
deoxyribonuclease II beta PBQ7 
hypothetical protein FU10188 PBJ1 
prostate epithelium-specific Ets transcr PEN1 
ESTs, Weakly similar to T1 7248 hypothBfj PE07 
ATP-binding cassette, sub-family C (CFTR PBH5 
uroplakin 3 PEL9 
secretory carrier membrane protein 1 PBY4 
Homo sapiens LUCA-1 5 protein mRNA, splic 
ESTs CBF9 
Homo sapiens cDNA FU12166 fts. clone MA 
ESTs PCQ7 
Homo sapiens clone 24670 mRNA sequence 
ESTs, Moderately similar to SPCNJHUMAN S 
ESTs PBQ9 
Homo sapiens cDNA: FU23241 fls, clone C 
ESTs PBM3 
ESTs, Weakly similar to I38022 hypotheti PBH4 
gb:tt88f04.x1 NCI_CGAP_Pr28 Homo sapiens 
Homo sapiens cDNA FU13581 lis, clone PL 
small nuclear protein PRAC CBK1 
PBJ2 

guanine nucleotide binding protein 4 PEW-1 
hypothetical protein PBM1 
TMPRSS2, transmembrane protease, serine 
ESTs PBQ4 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
ESTs PCI4 
ESTs, Moderately similar to T14342 NSD1 PBH7 
gb:za21 f09.s1 Soares fetal liver spleen PBQ5 
gb:hw31a09j(1 NCI_CGAP_KidU Homosapien 
PBQ8 
. PBH2 
PBY9 
PBY7 



plasma membrane 
nuclear 

plasma membrane 




not determined 
PBY8 not determined 
secreted 

PBQ1 not determined 



PBY6 not determined 



cytoplasmic 
•PCW6 
PBJ4 plasma 

not determined 
cytoplasmic 
not determined 
PEL3 plasma i 
plasma membrane 
PCQ1 cytoplasmic 
nuclear 



PBJB not determined 
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2 H20276 Hs.31742 

0 S79876 Hs.44926 
6 AF216077 Hs.48376 
2 AK000631 Hs.52256 

1 NM_0Q5982Hs.54416 
6 U80034 Hs.68583 
5 BE244589 Hs.75207 



AA1 47224 Hs.337232 
AA948033 Hs.130853 
X57010 Hs.81343 
AA279490 Hs.86368 
AI820961 Hs.193465 
NM_004996Hs.89433 
U24577 Hs.93304 
AW161450 Hs.109201 
NM_001141Hs.111256 
AW102723 Hs.75295 
NM_013989Hs.154424 



418882 
419839 
421887 



425710 
427958 
428819 



431217 
431716 
431992 



446320 AF126245 Hs.14791 

447210 AF035269 

449156 AF103907 Hs.171353 

449625 NMJJ14253 

449650 AF055575 Hs.23838 

451939 U80456 Hs.27311 

451982 F13036 Hs.27373 



AA418000 Hs.98280 
AL135623 Hs.193914 
AA460421 Hs.30875 
AW873986 Hs.1 19383 
BE245562 Hs.2551 
NM_013427Hs.250830 
D89053 HS268012 
NM_002742Hs,2891 
AA527941 

AI669973 Hs.200574 
W07088 Hs.293685 
AA650114 Hs.325198 
AI446444 Hs.1 90394 
AI972867 Hs.7130 
AA909358 Hs.128612 



ESTs PEW7 
dipeptidylpaptidase IV (CD26, adenosine PEZ3 
Homo sapiens clone HB-2 mRNA sequence 
hypothetical protein FU20624 PFG1 
sine oculis homeobox (Drasophila) homolo PEW3 
mitochondrial intermediate peptidase PEZ9 
glyoxalase | PFJ3 
ESTs OBH6 
HomeoboxA13 PFC6 
ESTs PEZ5 
"collagen, type II, alpha 1 (primary ost PFJ1 
calmegin PFA1 
ESTs PEY4 
ATP-binding cassette, sub-family C (CFTR OBH2 
"phospholipase A2, group VII (platelet-a PFH9 
CGI-86 protein PFH2 
"arachidonate 15-lipoxygenase, second ty PFH5 
guanylate cyclase 1 , soluble, alpha 3 PFA3 
"deiodinase.iodothyronlne.typell" PFH6 
solute carrier family, member 4 PFD4 
potassium intermediate/small conductance PFH1 
KIM0575 gene product PFD6 
ESTs PE27 
ESTs PEY5 
adrenergic, beta-2-, receptor, surface PEZ4 
Rho GTPase activating protein 6 PFG6 
fatty-acid-Coenzyme A ligase, long-chain PEZ1 
protein kinase C, mu PFH4 
gb:nh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 
ESTs PEW8 
ESTs PFG3 
ESTs PEY3 
ESTs, Weakly similar to B28096 line-1 pr PEW5 
copinelV PEWS 
ESTs PFC8 
cortactin SH3 domain-binding protein PEZ6 
"acyl-Coenzyme A dehydrogenase family, m 
phosphatidylserine-specific phospholipas PFH8 
prostate cancer antigen 3, non-coding DD PEZ6 
odz (odd Oz/ten-m, Drasophila) homobg 1 PEZ2 
calcium channel, voltage-dependent, L ty PFD2 
single-minded (Drosophila) homolog 2 PFJ8 
Homo sapiens mRNA; cDNA DKFZp56401763 (f 
ESTs PFD8 
ISL1 transcription factor, LlfvVhomeodoma PFG4 
hypothetical protein FU21062 PFC5 
EphA5 PFH3 
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TABLE 15A shows the accession numbers for those primekeys lacking a unigenelD in Table 
15. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
5 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



Pkey CAT number Accession 



25 101485 18113J 



AI972402 AI634409 AI523716 AI799749 W44518 A1424438 AI688513 AI971048 AI686324 AW013854 AA588483 AA52811 1 AI627428 
AI582200 AI669296 AI826926 AI620526 AI669958 AI9724S8 AI924500 AA512903 W44517 M335363 AW238997 BE30O1 65 
BE250665 AA2841 95 AA523420 W52834 AI471970 AI9S2824 AW003820 AW009463 AA669796 AA1 14966 AI6S3342 AA1 15038 
AI342150 A1092100 AI968211 W51994 AI804005 AI201420 AI123210 AI738405 AI674964 AI970341 AW027500 AI493316 AI333193 
AI139353 AA599463 AI656163 AI804200 AI365321 AI990213 AI657011 AA650025 AI968810 AI341978 AA599839 AW592602 
AA644289 AI468578 AI565265 AI565228 BE221 535 AW973052 

AA296520 AL021940 M30640 NM_000450 M24736 M61894 AL047443 H39560 AI694691 AA916787 AI214796 AA939085 AI150616 
AA412553 AA412545 AI051015 T27654 AA694430 

AA088767 AF224278 AA128075 AL035541 AA027926 AI761441 AI972Q96 AW071693 AI742327 AI377498 AI804815 AI640802 
AI885001 AI921394 AA5951 15 N71820 AI921217 AW007283 AI467828 AI369306 AA917446 AI493698 AA088701 AA126899 A1936228 
AW204238 AI039567 AI925027 BE138909 AW452945 AW135998 AA310984 AA027850 AW073519 AI537597 AA953976 AI521341 
AW273569 AW050740 AA536113 AA559064 AI474392 AW135709 AA535181 AW572959 AA570597 AI905464 AI677810 AI587B42 
AW975102AA424310AA482527 N64192AA658276AW889117AA486591 AW889172AI381990AI381991 AI673419 AI990950 



AW369S62 AA512956 AA541675 AA451748 AI250993 BE146418 AA122025 

AI362575 AI805082 AW263421 AI432462 AA135870 AA031360 AA031604 AA298475 AA29B464 

NM.012445 AB027466 BE407510 BE047605 AA047125 AW084003 AA149494 AA149490 AA292528 AA570505 AA526186 AW00G250 
AW007762 AI341557 AI799666 AI972710 AI377956 A1962810 AIQ84783 AI458032 AI190971 AW148913 AA372354 AW970032 
AW007426 AA850188 AI123203 AI122890 AI280975 W73595 W73495 AI863238 AA374109 AA603986 AW149089 AW957523 
AI30774S AI921067 AI336463 F24537 A138045O AI36750O AI1 89309 A1814701 AI76S921 AW572106 AA037024 AW072576 AA578293 
AI288103 AA235464 AW450642 AA574230 AW294024 AI589229 AI5B0733 AW512227 AA877009 AI660255 AW1 88597 AA558228 
AI572782 AA658397 AI274628 AI866359 AA8B4573 AI264439 AA621604 AW515493 AW243333 Z39737 AI567038 AA573997 
AA573559 AW236431 AI652870 AI684973 AA034505 AA047126 

AI267700 AI720344 AA191424 AI023543 AI469633 AA172058 AW958465 AA172236 AW953397 AA355086 
AL080235 AA031760 D81382 AI480231 AI095947 AI560953 BE010721 AI870290 AA374945 AA125792 D51527 D51556 AI685541 
D51559 AW1 17286 AA195741 AI675138 AW593439 A1201885 T30590 AW952100 D51095 AA523864 W70043 AA98758S AI421515 
AI205532 AA127069 AI337367 D51 595 AI453785 AW075677 AW088359 C1 4287 C14284 

AF163474 NM_016590 AF163475 AI761105 AI77009B AA410580 AA411616 AI590343 AI739050 AL050198 AI862645 AA419104 
AA513809 AA333032 AI816915 AW139625 AA640889 AI311391 AI627693 AW135514 AA41901 1 AI269149 AI245259 AI970008 
AI970017 AW139445 AA569503 AI761072 AI766179 AI759995 AI30O776 AI870129 AW150770 AA22B501 AA226220 . 
AI249368 AI742316 AA428062 AA442089 AI864189 BE349478 AI803475 AI584049 BE552085 AI088609 AI264197 AI886144 AI129474 
AI307145 BE181300 AW058403 AI696838 AW748598 AA442196 AI216428 



315051 347217J AW292425 BE467167 AI702953 BE550961 BE222309 AI299348 AI693336 AA541708 
324626 336411.1 AI685464 AW971336AA513587 AA525142 

319191 16065J NM.012391 AF071538 AB031549 AI685592 AI745526 AA662204 AW130657 AA662164 AW971 121 AI568916 AA513274 AI991223 
AI979170 AW298436AA639821 AI859010 AW513942 AI687669AA662521 AA548598 AI345056 AI305374 BE043418 AI432856 
AI334840 AI379796 AI492693 AI307915 BE042082 AI307834 AI307858 AI309488 BE042210 AI435670 AI371605 AI862491 AI2B4563 
AI306872 AI255044 AI254601 AI251236 AI473073 AI473042 AI432760 AI435664 AI336826 AI289365 AI369096 AI852274 AI334871 
AI349863 AI250405 AI377617 AI309895 AI313017 AI862291 AI31 1936 AI378718 AI305722 AI306769 AI308888 AI334565 AI862296 
AI344230 AI435685 AI344037 AI378696 AI311209 AI435775 AI310611 AI311154 AI432289 AI431561 AI492681 AI432867 AI335288 
AI492796 AI432769 AI310299 AI432273 AI379820 AI275319 AI435753 AI609441 AI432767 AI369100 AI31 1420 AI349974 AI247157 
AI334677 AI270910 AI224320 AI305608 AI334489 AI377152 AI350012 AI370086 AI335053 AI306781 AI306750 AI334849 AI334874 
AI340380 AI307876 AI305974 AI305972 AI311521 AI334872 AI862509 AI311498 AI335051 AI289684 AI310859 AI311862 AI862483 
AI492775 AI307906 AI492708 AI289693 AI340373 AI30791 0 AI31 1359 AI435653 AI334865 AI31 1492 AI492809 AI492690 AI431576 
AI862268 AI311879 AI308435 AI492792 AI862512 AI275321 AI431588 AI431564 AI307885 AI307926 AI435692 AI435778 AI310182 
AI308894 AI492707 AI492713 AI308560 AI307829 AI343234 AI580598 AW472796 AI340918 AI310243 AI309368 AI307920 AI289665 
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AI306777 AW08631 8 AW086292 AW086378 AI310027 AI275293 AI369082 AI340900 AI306749 AI371558 AW086287 BE043803 
AI306793 AI306272 AI287948 AI270917 AI284816 AI336813 AI284546 AI308044 AI275290 AI270872 AI306795 AI289687 AI223570 
AI305303 AI289B77 AI287742 AI275284 AI306812 AI336701 AI371554 AI378719 AI344988 AI223631 AI335141 AI343222 AI284568 
AI305357 AI27S270 AI34S932 AI436549 AI307925 AI31 1502 AI344238 AI343182 AI308508 AI30S988 A1270790 AI379792 AI305547 
AI305410 AI432251 AI436517 AI343227 AI30S534 AI340387 AI271043 AI305499 AI271046 AI305962 AI289465 AI305378 AI289725 
AI31 0848 AI305848 AI289362 AI252964 AI307049 AI310831 AI306993 AI306796 AI224659 AI305969 AI349855 AI306164 AI3Q6948 
AI284676 AI309155 AI343202 AI432785 AI30S815 AI369081 AI270885 AI289699 AI435704 AI309647 AI305716 AI31 1281 AI287927 
AI472995 AI340423 AI270958 AI307069 AI305364 AI270807 AI275306 AI311890AI275263 AI432750AK89371 AI432861 AI25S113 
AI305709 AI473008 AI31 1 168 AI30971 1 AI377164 AI271201 AI289560 AI309710 AI306195 AI31 1201 AI287741 AI271066 AI432876 
AI275281 AI379795 AI472972 AI31 1967 AI308826 AI305465 AI270792 AI473019 AI305340 AI270922 AI305995 AI305462 AI254144 
AI270969 AI473012 AI305390 AI275278 AI223644 AI289692 AI250318 AI305372 AI289691 AI2S0521 AI306283 AI306B14 AI307933 
AI473160 AI432903 AI223720 AI254979 AI334862 AI306926 AI289541 AI432248 AI435722 AI435698 AI432859 AI310683 A147317S 
AI335144 AI289467 AI436489 AI3Q6928 AI473033 AI305763 AI307868 AI307882 AI348959 A1435736 AI432857 AI432896 A1435735 
AI432283 AI473086 AI432863 AI4730B1 AI432825 AI307840 AI473164 AI432885 AI473166 AI472982 AI435734 AI473Q60 AI473171 
AI432279 AI432882 AI334670 AI436512 AI432827 AI432852 AI473051 AI473077 AI435697 AI271509 AI492781 AI472983 AI473018 
AI432897 AI473Q43 AI432871 AI436536 AI473157 AI349715 AI432777 AI473016 AI4731S8 AI340369 AI307941 AI432773 AI377146 
AI492791 AI270950 AI305342 AI284604 AI305259 AI28481 1 AI27081 1 AI289347 AI334869 AI334852 AI31 1759 AI250382 AI309520 
AI2B9550 AI305721 AI34O870 AI2709D1 AI303575 AI307904 AI340715 AI270941 AI309808 AI246867 AI473014 AI307039 AI289360 
AI473069 AI492786 AI344013 AI305876 AI436510 AI340742 AI473028 AI307891 BE041871 BE041268 BE042340 BE041946 
BE041783 AI306173 AI201948 AI926972 AI275769 

338255 CH22_6856FG_UNK_EM:ACO0 

330211 c_5j>2 

332798 CH22J4FG 6_5_LINK C4G1.G 
334447 CH22_1746FG_387_7_LINK_EM 

332247 372969J AA669097 AA513815 AA026798 AA676526 AA704429 AA704269 AW1 18292 AA57921S N58172 

332396 20265J AW579842 BE156562 BE156690 BE1S6489 BE081033 AK001559 BE149402 M85387 AW367811 AW367798 R17370 AI908947 

AA382932 R58449 H18732 AA371231 AW962899 AA7f 3530 AW892946 R53463 H11063 AW068542 Z40761 BE176212 BE176155 
W23952 W92188 AW374883 AA303497 AW954769 AA036808 BE168063 AW382073 AW382085 AL041475 H80748 AI078161 
BE463983 AI805213 AI761264 W94885 N94502 AI623772 AI419532 AI810302 Al 634190 AW002516AW150777 AI352312 AI367474 
AW204807 AI675502 AI337026 AW134715 BE328451 AI123157 AI560020 AI300745 AI608631 AI248873 AA742484 AW051635 
H18646 AI245045 AA5071 1 1 AI64051 0 AI925594 AA1 15747 AA143035 AA151 106 

332697 13699J X514O5NM_O01873T11322AL118886BE328175AW1380O9BE467445AW470313M774852BBO4139AW501O46AA082792 
AW389231 AA370O44 R36841 AA371457 C04813 R25791 R25556 AW895854 AW903819 AW895671 AW895677 BE159723 
AW895664 AW895597 AW895595 AW895665 AW888518 AI903724 F06081 F08503 AL1 19462 AW895730 AWB8851 6 R2651 1 
R26489 AA334126 AA327626 N85713 AW895998 AA223622 F05468 AA370749 W05590 M78202 AA371073 AW498607 R15017 
T16991 M(X)1282 M001138AA551566 M330159AI922855AA383512AA029603D82246D82171 T94933H56545AA348060 
AA176888 R98764AW451817AA385766 AA452618 AI690057 M988822BE549928 M150901 W57992AW899925 C05281 
AA932042 AA370980 AW962877 W04741 AA369982 AW385948 AA922466 N75882 AW22070 AI361256 AI680224 D57122 T94885 
R53266 R46713T19071AW796277AA325333FM719F()2334 M358146 M626597AA^ 



AW961219 AA453239 AW837541 N45521 BE218029 AA318677 AA327740 AW961809 T92139 D53216 D52365 D53363 D53312 
D531 16 AI547267 AA679935 AW026552 AW026418 AW1 90507 AI927710 AW244108 D50948 AW054991 AW021063 AW02251 1 
AA493436 AI365636 BE464751 AW149384 AA102442 AW771368 AI818251 AI126368 D51049 AI421542 AI559467 AW079779 
AW021O48 AW023969 AW044214 AI458264 AA027274 AI620254 AW026917 BE219511 AA326242 N67561 AI971273 AA878328 

45 D57131 AA770662 AI309299 AI796767 AA613338 W58076 A1566287 AI445573 AI880260 AA001919 AW339259 AI492610 AI492611 

R97692 AI3Q1425 AA722603 D58361 AI350323 AA973926 AI431263 AA516126 AA865467 AI925177 N39443 AA001943 AI299371 
AI082412 AA6S5090 AA583433 H89871 M977231 AI362219 AI056096 AI270446 N67524 N22103 AW614224 AA744054 AW243622 
AI613188 AI929173 AI350243 AI362138 AA744004 AA176661 D56787 AI955625 AI393109 AI094769 AI479728 AI423107 AI955617 
AI034036 AI582196 AW264534 AI418961 AA570761 AI343538 M650341 AA992503 AA770004 AL039666 AI862675 AW1 90335 

50 AA610274 AW418627 BE467472 D56786 T28749 AI217610 AI359556 T23523 AL040189 AA846222 AA651636 D512B0 AI888986 

AI521 167 AI340177 AW612815 AI625285 AA621607 M177059 AA229768 AA829788 AI749682 AW190331 N75299 AA230089 
AI915632 BE059542 AA890020 AA52B397 AA995390 BE503860 AA570812 AW339396 AI197986 AIH03725 AI2B2379 AA670375 
AA461513 F01728 AW243599 C00856 N75567 R95995 AA150932A95961 AA648060 AA933800 AA927073 M101126 AA864190 
T93566BE167472 

55 425710 25529J AF030880 NM_000441 ACO02467 AA385554 H23053 AW891838 AI139968 AA653057 AI695233 
432189 342819J AA527941 AI810608 AI620190AA635266 

445424 6391J AB028945 T77648 F13328 AL157605 Z46212 AA304736 F1 1 855 T66098 T30174 AW954164 AW176301 AW748243 AA456428 
AI369958 AA938565 AW959613 Z42003 M994779 AI683909 F11019 F10926 AI769597 AI752550 T65015 AI884314 AA643954 
Z41838 AW020147 AI038822 AW571822 AA299781 AA894928 AF131790 BE00541 1 AI902476 AW082695 AA464384 R42750 
60 AW902301 AA464273 R05837 Z38294 H41098 AL134507 M86079 

447210 71 19J AF035269 AF035268 NM_015900 T96213 U37591 AA156832 AA299371 AI084325 H95977 AI765967 BE221465 AA156726 AI969563 
AW024539 AI436791 AI949451 AA343093 AI452756 AA824232 AI306667 T96131 AW207447 AW243556 AW957032 AI084332 
H95978 U30998 

449625 8113J NM_014253 AF1 00772 BE038769 AL022718 BE161779 AW863569BE161640 AL039060BE1 68542 AW296554 AA323193 AA235370 
65 AW779760 N48674AI375997 R45432 D59344AI203107 F07491 R35360 R25094AI913631 AI498402 T61382 AI016320 N45526 

T61415AA331486 

452039 89513J AI922988 H05475 AA0216O8 AW169947AA913750 Z41614 AW8O0O12 
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TABLE 15B shows the genomic positioning for those primekeys lacking unigene ID's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
5 listed. 

Pkey: Unique number corresponding to art Eos probeset 

Hef: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the 

publication entitled The DMA 

10 sequence of human chromosome 22." Dunham I. et al. Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



15 Pkey Ref Strand NLposition 

334447 Dunham, I. etal. Plus 14308764-14308824 

332798 Dunham, I. etal. Minus 232147-231974 

338255 Dunham, I. etal. Minus 15242294-15242231 

20 330211 6013592 Plus 59158-59215 

401424 8176894 Plus 24223-24428 



301 



WO 02/30268 



TABLE 11 AND SEQUENCE LISTING 

SEQIDNCM BCU4 DNA SEQUENCE 

Nucleic Acid Accession* NM.024915 

Coding sequence: 13-1690 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATTGGATCAA ACATCTCACA AGAGTCGGAC AATAATAAAA GACTAGTGGC CTTAGTGCCC 60 
ATGCCCAGTG ACCCTCCATT CAATACCCGA AGAGCCTACA CCAGTGAGGA TGAAGCCTGG 120 
AAGTC ATACT TGGAGAATCC CCTGACAGCA GCCACCAAGG CCATGATGAT CATTAATGGT ISO 
GATGAGGACA GTGCTGCTGC CCTCGGCCTG CTCTATGACT ACTACAAGGT TCCTCGAGAC 240 
AAGAGGCTGC TGTCTGTAAG CAAAGCAAGT GACAGCCAAG A AGA CCAGGA GAAAAGAAAC 300 
TGCCTTGGCA CCAGTGAAGC CCAGAGTAAT TTGAGTGGAG GAGAAAACCG AGTGCAAGTC 360 
CTAAAGACTG TTCCAGTGAA CCTTTCCCTA AATCAAGATC ACCTGGAGAA TTCCAAGCGG 420 
GAACAGTACA GCATCAGCTT CCCCGAGAGC TCTGCCATCA TCCCGGTGTC GGGAATCACG 480 
GTGGTGAAAG CTGAAGATTT CACACCAGTT TTCATGGCCC CACCTGTGCA CTATCCCCGG 540 
GGAGATGGGG AAGAGCAACG AGTGGTTATC TTTGAACAGA CTCAGTATGA CGTGCCCTCG 600 
CTGGCCACCC ACAGCGCCTA TCTCAAAG AC GACCAGCGCA GCACTCCGGA CAGCACATAC 660 
AGCGAGAGCT TCAAGG ACGC AGCCACAGAG AAATTTCGGA GTGC1TCAGT TGGGGCTGAG 720 
GAGTACATGT ATGATCAGAC ATCAAGTGGC ACATTTCAGT ACACCCTGGA AGCCACCAAA 780 
TCTCTCCGTC AGAAGCAGGG GGAGGGCCCC ATGACCTACC TCAACAAAGG ACAGTTCTAT 840 
GCCATAACAC TCAGCGAGAC CGGAGACAAC AAATGCTTCC GACACCCCAT CAGCAAAGTC 900 
AGGAGTGTGG TGATGGTGGT CTTCAGTGAA GACAAAAACA GAGATGAACA GCTCAAATAC 960 
TGGAAATACT GGCACTCTCG GCAGCATACG GCGAAGCAGA GGGTCCTTGA CATTGCCGAT 1020 
TACAAGGAGA GCTTTAATAC GATTGGAAAC ATTGAAGAGA TTGCATATAA TGCTGTTTCC 1080 
TTTACCTGGG ACGTGAATGA AGAGGCGAAG ATTTTCATCA CCGTGAATTG CTTGAGCACA 1140 
GATTCCTCCT CCCAAAAAGG GGTGAAAGGA CTTCCTTTGA TGA1TCAGAT TGACACATAC 1200 
AGTTATAACA ATCGTAGCAA TAAACCCATT CATAGAGCTT ATTGCCAGAT CAAGGTCTTC 1260 
TGTGACAAAG GAGCAGAAAG AAAAATCCGA GATGAAGAGC AGAAGCAGAA CAGGAAGAAC 1320 
GGGAAAGGCC AGGCCTCCCA AACICAATGC AACAGCTCCT CTGATGGGAA GTTGGCTGCC 1380 
ATACCTTTAC AGAAGAAGAG TGACATCACC TACTTCAAAA CCATGCCTGA TCTCCACTCA 1440 
CAGCCAGTTC TCTTCATACC TGATGTTCAC TTTGCAAACC TGCAGAGGAC CGGACAGGTG 1500 
TATTACAACA CGGATGATG A ACGAGAAGGT GGCAGTGTCC TTGTTAAACG GATGTTCCGG 1560 
CCCATGGAAG AGGAGTTTGG TCCGGTGCCT TCAAAGCAGA TGAAAGAAGA AGGGACAAAG 1620 
CGAGTGCTCT TGTACGTGAG GAAGGAGACT GACGATGTGT TCGATGCATT GATGTTGAAG 1680 
TCTCCCACAG TGATGGGCCT GATGGAAGCG ATATCTGAGA AATATGGGCT GCCCGTGGAG 1740 
AAGATAGCAA AGCTTTACAA GAAAAGCAAA AAAGGCATCT TGGTGAACAT GGATGACAAC 1800 
ATCATCGAGC ACTACTCGAA CGAGGACACC TTCATCCTCA ACATGGAGAG CATGGTGGAG I860 
GGCTTCAAGG TCACGCTCAT GGAAATCTAQ CCCTGGGTTT GGCATCCGCT TTGGCTGGAG 1920 
CTCTCAGTGC GTTCCTCCCT GAGAGAGACA GAAGCCCCAG CCCCAGAACC TGGAGACCCA 1980 
TCTCCCCCAT CTCACAACTG CTGTTACAAG ACCGTGCTGG GGAGTGGGGC AAGGGACAGG 2040 
CCCCACAGTC GGTGTGCTTG GCCCATCCAC TGGCACCTAC CACGGAGCCG AAGCCTGAGC 2100 
CCCTCAGGAA GGTGCCTTAG GCCTGTTGGA TTCCTATTTA TTGCCCACCT TTTCCTGGAG 2160 
CCCAGGTCCA GGCCCGCCAG GACTCTGCAG GTCACTGCTA GCTCCAGATG AGACCGTCCA 2220 
GCGTTCCCCC TTCAAGAGAA ACACTCATCC CGAACAGCCT AAAAAATTCC CATCCCTTCT 2280 
TTCTCACCCC TCCATATCTA TATCTCCCGA GTGGCTGGAC AAAATGAGCT ACGTCTGGGT 2340 
GCAGTAGTTA TAGGTGGGGC AAGAGGTGGA TGCCCACTTT CTGGTCAGAC ACCTTTAGGT 2400 
TGCTCTGGGG AAGGCTGTCT TGCTAAATAC CTCCAGGGTT CCCAGCAAGT GGCCACCAGG 2460 
CCTTGTACAG G AAGACATTC AGTCACCGTG TAATTAGTAA CACAGAAAGT CTGCCTGTCT 2520 
GCATTGTACA TAGTGTTTAT AATATTGTAA TAATATATTT TACCTGTGGT ATGTGGG CAT 2 580 
GTTTACTGCC ACTGGCCTAG AGGAGACACA GACCTGGAGA CCGTTTTAAT GGGGGTTTTT 2640 
GCCTCTGTGC CTGTTCAAGA GACTTGCAGG GCTAGGTAGA GGGCCTTTGG GATGTTAAGG 2700 
TGACTGCAGC TGATGCCAAG ATGGACTCTG CAATGGGCAT ACCTGGGGGC TCGTTCCCTG 2760 
TCCCCAGAGG AAGCCCCCTC TCCTTCTCCA TGGGCATGAC TCTCCTTCGA GGCCACCACG 2820 
TTTATCTCAC AATGATGTGT TTTGCCTGAC TTTCCCTTTG CGCTGTCTCG TGGGAAAGGT 2880 
CATTCTGTCT GAGACCCCAG CTCCTTCTCC AGCTTIGGCT GCGGGCATGG CCTGAGCTTT 2940 
CTGGAGAGCC TCTGCAGGGG GTTTGCCATC AGGGCCCTGT GGCTGGGTCT GCTGCAGAGC 3000 
TCCTTGGCTA TCAGGAGAAT CCTGGACACT GTACTGTGCC TCCCAGTTTA CAAACACGCC 3060 
CTTCATCTCA AGTGGCCCTT TAAAAGGCCT GCTGCCATGT GAGAGCTGTG AACAGCTCAG 3120 
CTCTGAGTCG GCAGACTGGG GCTTCCTCCT GGGCCACCAG ATGGAAAGGG GGTATTGTTT 3 180 
GCCTCACTCC TGGATGCTGC GTTTTAAGGA AGTGAGTGAG AAAGAATGTG CCAAGATACC 3240 
TGGCTCCTGT GAAACCAGCC TCAGGAGGGA AACTGGGAGA GAGAAGCTGT GGTCTCCTGC 3300 
TACATGCCCT GGGAGCTGGA AGAGAAAAAC ACTCCCCTAA ACAATCGCAA AATGATGAAC 3360 
CATCATGGGC CACTGTTCTC TTTGAGGGGA CAGGTTTAGG GGTTTGCGTT CGCCCTTGTG 3420 
GGCTGAAGCA CTAGCTTTTT GGTAGCTAGA CACATCCTGC ACCCAAAGGT TCTCTACAAA 3480 
GGCCCAGATT TGTTTGTAAA GCACTTTGAC TCTTACCTGG AGGCCCGCTC TCTAAGGGCT 3540 
TCCTGCGCTC CCACCTCATC TGTCCCTGAG ATGCAGAGCA GGATGGAGGG TCTGCTTCTA 3600 
GCTCAGCTGT TTCTCCTTG A GGTTGCGG AG GAATTGAATT GAATGGGACA G AGGGCAGGT 3660 
GCTGTGGCCA AGAAGATCTC CGAGCAGCAG TGACGGGGCA CCTTGCTGTG TGTCCTCTGG 3720 
GCATGTTAAC CCTTCTGTGG GGCCAAAGGT TTGCATCGTG GATCCAGCTG TGCTCCAGTC 3780 
TGTCCCCTCC TCCTCCACTC TG ACTGCCAC GCCCCGGACC AGCAGCTTGG GGACCCTCCA 3840 
GGGTACTAAT GGGGCTCTGT TCTGAGATGG ACAAATTCAG TGTTGGAAAT ACATGTTGTA 3900 
CTATGCACTT CCCATGCTCC TAGGGTTAGG AATAGTTTCA AACATGATTG GCAGACATAA 3960 
CAACGGCAAA TACTCGGACT GGGGCATAGG ACTCCAGAGT AGGAAAAAGA CAAAAGATTT 4020 
GGCAGCCTGA CACAGGCAAC CTACCCCTCT CTCTCCAGCC TCTTTATGAA ACTGTTTGTT 4080 
TGCCAGTCCT GCCCTAAGGC AGAAGATGAA TTGAAGATGC TGTGCATGTT TCCTAAGTCC 4140 
TTGAGCAATC ATGGTGGTGA CAATTGCCAC AAGGGATATG AGGCCAGTGC CACCAGAGGG 4200 
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20 



45 
50 
55 
60 
65 
70 
75 



TGGTGCCAAG TGCCACATCC CTTCCGATCC ATTCCCCTCT GTATCCTCGG AGCACCCCAG 4260 
TTTGCCTTTG ATGTGTCCGC TGTGTATGTT AGCTGAACTT TGATGAGCAA AATTTCCTGA 4320 
GCGAAACACT CCAAAGAGAT AGGAAAACTT GCCGCCTCTT CTTTTTTGTC CCTTAATCAA 4380 
ACTCAAATAA GCTTAAAAAA AATCCATGGA AGATCATGGA CATGTGAAAT GAGCATTTTT 4440 
5 TTCTTTTCTT TTTTTTTTTT nTTTTTAAC AAAGTCTGAA CTGAACAGAA CAAG ACTTTT 4500 
TCCTCATACA TCTCCAAATT GTTTAAACTT ACTTTATGAG TGTTTGnTA GAAGTTCGGA 4560 
CCAACAGAAA AATGCAGTCA GATGTCATCT TGGAATTGGT TTCTAAAAGA GTAAGGCATG 4620 
TCCCTGCCCA GAAACTTAGG AAGCATGAAA TAAATCAAAT GTTTATTTTC CTTCTTATTT 4680 
AAAATCATGC TAATGCAACA GAAATAGAGG GTTTGTGCCA AATGCTATGA ACGGCCCTTT 4740 
1 0 CTTAAAG ACA AGCAAGGGAG ATTGATATAT GTACAATTTG CTCTCATGTT TTT 

15 



MSQESDNNKR LVALVPMPSD PPFNTRRAYT SEDEAWKSYL ENPLTAATKA MMHNGDEDS 60 
AAALGLLYDY YKVPRDKRLL SVSKASDSQE DQEKRNCLGT SEAQSNLSGG ENRVQVLKTV 120 
PVNLSLNQDH LENSKREQYS ISFPESSAII PVSGIT WKA EDFTPVFMAP PVHYPRGDGE 180 
EQRVVJFEQT QYDVPSLATH SAYLKDDQRS TPDSTYSESF KDAATEKFRS ASVGAEEYMY 240 
DQTSSGTFQY TLEATKSLRQ KQGEGPMTYL NKGQFYAITL SETGDNKCFR HPISKVRSVV 300 
MVVFSEDKNR DEQLKYWKYW HSRQHTAKQR VLDIADYKES FNTIGNIEEI AYNAVSFTWD 360 
VNEEAKIFIT VNCLSTDFSS QKGVKGLPLM IQEDTYSYNN RSNKPIHRAY CQIKVFCDKG 420 
AERKIRDEEQ KQNRKNGKGQ ASQTQCNSSS DGKLAA1PLQ KKSDFTYFKT MPDLHSQPVL 480 
FIPDVHFANL QRTGQVYYNT DDEREGGS VL VKRMFRPMEE EFGPVPSKQM KEEGTKRVLL 540 
YVRKETDDVF DALMLKSPTV MGLMEAISEK YGLPVEKIAK LYKKSKKGIL VNMDDNIIEH 600 
YSNEDTFILN MESMVEGFKV TLMEI 



Nucleic Acid Accession*: 



SEQ ID N0:3 BCU7 DNA SEQUENCE VARIANT 1: 



ATGATAGCAA TCTCTGCCGT CAGCAGTGCA CTCCTGTTCT CCCTTCTCTG TGAAGCAAGT 
ACCGTCGTCC TACTCAATTC CACTGACTCA TCCCCGCCAA CCAATAATTT C 
GAAGCAGCTC TGAAAGCACA ATTAGATTCA GCGGATATCC CCAAAGC 
TACATTTCGC AGAATGACAT GATCGCCATT CTTGATTATC ATAATCAAGT T 



GCAGAGGCTT GGGCGGCTAC T 

TTGGGCCAAA ATCTATCTGT ACGCACTGGA AGATATCGCT CTATTCTCCA G 
CCATGGTATG ATGAAGTGAA AGATTATGCT TTTCCATATC CCCAGGATTG C 
TGTCCTATGA GATGTTTTGG TCCCATGTGC ACACATTATA CGCAGATGGT TTGGGCCACT 540 
TCCAATCGGA TAGGATGCGC AATTCATGCT TGCCAAAACA TGAATGTTTG GGGATCTGTG 600 
TGGCGACGTG CAGTTTACTT GGTATGCAAC TATGCCCCAA AGGGCAATTG GATTGGAGAA 660 
GCACCATATA AAGTAGGGGT ACCATGTTCA TCTTGTCCTC CAAGTTATGG GGGATCTTGT 720 
ACTGACAATC TGTGTTTTCC AGGAGTTACG TCAAACTACC TGTACTGGTT TAAATAA 

SEQ ID N0:4 BCU7 DNA SEQUENCE VARIANT 2: 

Nucleic Acid Accession*: AA42S062 

Coding sequence: 1-777 (entire sequence represents open reading frame) 



ATGATAGCAA TCTCTGCCGT CAGCAGTGCA CTCCTGTTCT CCCTTCTCTG TGAAGCAAGT 
ACCGTCGTCC TACTCAATTC CACTGACTCA TCCCCGCCAA CCAATAATTT CACTGATATT 
GAAGCAGCTC TGAAAGCACA ATTAGATTCA GCGGATATCC CCAAAGCCAG GCGGAAGCGC 
TACATTTCGC AGAATGACAT GATCGCCATT CTTGATTATC ATAATCAAGT TCGGGGCAAA 
GTGTTCCCAC CGGCAGCAAA TATGGAATAT ATGGTTTGGG ATGAAAATCT TGCAAAATCG 



TATTCTCCA GTTGGTCAAG 
CCATGGTATG ATGAAGTGAA AGATTATGCT TTTCCATATC CCCAGGATTG CAACCCCAGA 
TGTCCTATGA GATGTTTTGG TCCCATGTGC ACACATTATA CGCAGATGGT TTGGGCCACT 
TCCAATCGGA TAGGATGCGC AATTCATACT TGCCAAAACA TGAATGTTTG GGGATCTGTG 
TGGCGACGTG CAGTTTACTT GGTATGCAAC TATGCCCCAA AGGGCAATTG GATTGGAGAA 
GCACCATATA AAGTAGGGGT ACCATGTTCA TCTTGTCCTC CAAGTTATGG GGGATCTTGT 
ACTGACAATC TGTGTTTTCC AGGAGTTACG TCAAACTACC TGTACTGGTT TAAATAA 

SEQ ID NO:5BCU7 Protein sequence Variant 1: 



WO 02/30268 



YISQNDHIAI LDYHNQVRGK VFPPAANMEY HVWDENLAKS AEAWAATCIW DHGPSYLLRF 

LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CEMRCFGPMC THYTQMVWAT 

SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 

TDNLCFPGVT SNYLYWFK 

SEQ ID NO:6BCU7 Protein sequence Variant 2: 



1 11 21 31 41 51 

I I I I I I 

MIAISAVSSA LLFSLLCEAS TWLLNSTDS SPPTNNFTDI EAALKAQLDS ADIPKARRKR 60 
YISQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CEMRCFGPMC THYTQMVWAT 180 
SNRIGCAIHT CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 

SEQ ID N0:7 BCX2 DNA SEQUENCE 

Nucleic Acid Accession *: NMJK3014 

Coding sequence: 238-1278 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGCGQGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 60 
CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 120 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCG AAG GGACAGCGAA AGATGAGGGT 180 
GGCAGGAAG A GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCAI2 240 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 300 
GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 360 
ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAGCAGTAC 420 
GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 480 
GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 540 
CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 600 
AGCCTGGCCT GCGACGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 660 
ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 720 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 780 
AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 
AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 900 
GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 960 
TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TC ATCATGTG TTACGAGTGG 1020 
CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 1080 
AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 1 140 
AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 
GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC AlTAAAACTA GGAGTGCCCA GAAGAGAACA 1260 
AACCCGAAAA GAGTGJQAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 1320 
GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTA AGGCC ATGTGCCCCT TGCCCTAACA 1380 
ACTCACTGCA GTCCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGGTTCA 1440 
GTTTTTCTTT GTAAGCCATC ACAAG CCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 1560 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 1620 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTnTACAG TATGTTTTAT TACCTTTTGA 1680 
TATCTGITGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 1740 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 1800 
TnTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 1860 
TGTGTmTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 1920 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTnTtTG 1980 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 2040 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 2100 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 2160 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC C AAATTTTGT TTTTGTTCAT 2220 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 2280 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 2340 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 2400 
ATTT ATTTT A AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTITAr A 2460 
AGGCATTCAA TAAATGCACA ACGCCCA A AG GAAATAAAAT CCTATCTAAT CCTACTCTCC 2520 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 2580 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 2640 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 2700 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 2760 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 



SEQ ID NO:8 BCX2 Protein sequence: 
Protein Accession*: NP_003005.1 

1 11 21 31 41 51 
I I I I I I 

MFLSILVALC LWLHLALGVR GAFCEAVRIP MCRHMPWNIT RMPNHLHHST QENAILA1EQ 60 
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YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDP1KPCKSV CQRARDDCEP LMKMYNHSWP 120 
ESLACDELPV YDRGVCISPE AIVTDLPEDV KW1DITPDMM VQERPLDVDC KRLSPDRCKC 180 
KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNEVTTVVDV KEIFKSSSP1 PRTQVPLITN 240 
SSCQCPH1LP HQDVLIMCYE WRSRMMLLEN CLVEKWRDQL SKRS1QWEER IXJEQRRTVQD 300 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 

SEQ ID N0;9 CBK1 DNA SEQUENCE 

Nucleic Acid Accession *: NM.032391 

Coding sequence: 129-302 (underlined sequences correspond to start and slop codons) 



I I I I I I 

GTCCTTCCTC TCCTAGCCTA AGGCGTGCAA ACAGAGCGCC ACTGGGAGGC TGAAACCTTT 
AGGCCGATGC OTGCTTGCAA GGTCAGGCAA GCTGGATTCT GGTCCCCACC TTTGCAGAGA 
GAACAGC GAT G TTGTGCGCC CATTTCTCAG ATCAAGGACC GGCCCATCTT ACTACCTCCA 
AGAGTGCTTT TCTCTCTAAT AAGAAAACAT CTACTTTGAA ACATCTACTG GGCGAGACCA 
GGAGTGATGG CTCAGCCTGT AATTCTGGAA TTTCGGGAGG CCGAGGCAGG AAGATTCCTT 
: AGCCTGGGCA ATGTAGCAAG ACGCTGTCTC TATTTATACA 



ATAAAATTTT TTTAAAAAAG G 



25 SEQ ID N0:1Q CBK1 Protein sequence: 
Protein Accession*: NPJ 15767 



MLCAHFSDQG PAHLTTSKSA FLSNKKTSTL KHLLGETRSD GSACNSGISG GRGRKIP 

SEQIDN0:11 CHA1 DNA SEQUENCE 



AACTGAAGGC GGACAGTCTC CTGCGAAACC AGGCAATGGC GGAGCTGGAG TTTGTTCAGA 
TCATCATCAT CGTGGTGGTG ATGATGGTGA TGGTGGTGGT GATCACGTGC CTGCTGAGCC 
ACTACAAGCT GTCTGCACGG TCCTTCATCA GCCGGCACAG C 
ATGCCCTGTC CTCAGAAGGA TGCCTGTGGC CCTCGGAGAG CACAGTGTCA CJ 

G GGAGCGCTTC CACCGCTTCC AGCCCACCTA TCCGTACCTG CAGCACGAGA 420 

C ACCCACCATC TCGCTGTCAG ACGGGGAGGA GCCCCCACCC TACCAGGGCC 480 

CCTGCACCCT CCAGCTTCGG GACCCCGAGC AGCAGCTGGA ACTGAACCGG GAGTCGGTGC 540 

GCGCACCCCC AAACAGAACC ATCTTCGACA GTGACCTGAT GGATAGTGCC AGGCTGGGCG 600 

GCCCCTGCCC CCCCAGCAGT AACTCGGGCA TCAGCGCCAC GTGCTACGGC AGCGGCGGGC 660 

GCATGGAGGG GCCGCCGCCC ACCTACAGCG AGGTCATCGG CCACTACCCG GGGTCCTCCT 720 

TCCAGCACCA GCAGAGCAGT GGGCCGCCCT CCTTGCTGGA GGGGACCCGG CTCCACCACA 780 

CACACATCGC GCCCCTAGAG AGCGCAGCCA TCTCGAGCAA AGAGAAGGAT AAACAGAAAG 840 

GACACCCTCT CTAGGGTCCC CAGGGGGGCC GGGCTGGGGC TGCGTAGGTG AAAAGGCAGA 900 

ACACTCCGCG CTTCTTAGAA GAGGAGTGAG AGGAAGGCGG GGGGCGCAGC AACGCATCGT 960 

GTGGCCCTCC CCTCCCACCT CCCTGTGTAT AAATATTTAC ATGTGATGTC TGGTCTGAAT 1020 

GCACAAGCTA AGAGAGCTTG CAAAAAAAAA AAGAAAAAAG AAAAAAAAAA ACCACGTTTC 1080 

C TGTGTCTTGA AGGCAAAAGA AAAAAAATTT CTACAGTAAA AAAAAAAAAA 1140 



1 11 21 31 41 51 

I I I I I I 

HAELEFVQII IIVWMMVMV WITCLLSHY KLSARSPISR HSQGRRREDA LSSEGCLWPS 
ESTVSGNGIP EPQVYAPPRP TDRLAVPPFA QRERPHRFQP TYPYLQHEID LPPTISLSDG 
EEPPPYQGPC TLQLRDPEQQ LELNRESVRA PPNRTIFDSD LMDSARLGGP CPPSSNSGIS 
ATCYGSGGRM EGPPPTYSEV IGHYPGSSFQ HQQSSGPPSL LEGTRLHHTH IAPLESAAIW 
SKEKDKQKGH PL 

SEQ ID N0:13 CJA5 DNA SEQUENCE 

Nucleic Acid Accession*: NM_012445 



WO 02/30268 



GCACGAGGGA AGAGGGTC3AT CCGACCCGGG GAAGGTCGCT GGGCAGGGCG AGTTGGGAAA 60 

GCGGCAGCCC CCGCCGCCCC CGCAGCCCCT TCTCCTCCTT TCTCCCACGT CCTATCTGCC 120 

TCTCGCTGGA GGCCAGGCCG TGCAGCATCG AAGACAGGAG GAACTGGAGC CTCATTGGCC 180 

GGCCCGGGGC GCCGGCCTCG GGCTTAAATA GGAGCTCCGG GCTCTGGCTG GGACCCGACC 240 

GCTGCCGGCC GCGCTCCCGC TGCTCCTGCC GGGTGATGGA AAACCCCAGC CCGGCCGCCG 300 

CCCTGGGCAA GGCCCTCTGC GCTCTCCTCC TGGCCACTCT CGGCGCCGCC GGCCAGCCTC 360 

TTGGGGGAGA GTCCATCTGT TCCGCCAGAG CCCCGGCCAA ATACAGCATC ACCTTCACGG 420 

GCAAGTGGAG CCAGACGGCC TTCCCCAAGC AGTACCCCCT GTTCCGCCCC CCTGCGCAGT 480 

GGTCTTCGCT GCTGGGGGCC GCGCATAGCT CCGACTACAG CATGTGGAGG AAGAACCAGT 540 

ACGTCAGTAA CGGGCTGCGC GACTTTGCGG AGCGCGGCGA GGCCTGGGCG CTGATGAAGG 600 

AGATCGAGGC GGCGGGGGAG GCGCTGCAGA GCGTGCACGC GGTGTTTTCG GCGCCCGCCG 660 

TCCCCAGCGG CACCGGGCAG ACGTCGGCGG AGCTGGAGGT GCAGCGCAGG CACTCGCTGG 720 

TCTCGTTTGT GGTGCGCATC GTGCCCAGCC CCGACTGGTT CGTGGGCGTG GACAGCCTGG 780 

ACCTGTGCGA CGGGGACCGT TGGCGGGAAC AGGCGGCGCT GGACCTGTAC CCCTACGACG 840 

CCGGGACGGA CAGCGGCTTC ACCTTCTCCT CCCCCAACTT CGCCACCATC CCGCAGGACA 900 

CGGTGACCGA GATAACGTCC TCCTCTCCCA GCCACCCGGC CAACTCCTTC TACTACCCGC 960 

GGCTGAAGGC CCTGCCTCCC ATCGCCAGGG TGACACTGGT GCGGCTGCGA CAGAGCCCCA 1020 

GGGCCTTCAT CCCTCCCGCC CCAGTCCTGC CCAGCAGGGA CAATGAGATT GTAGACAGCG 1080 

CCTCAGTTCC AGAAACGCCG CTGGACTCCG AGGTCTCCCT GTGGTCGTCC TGGGGACTGT 1140 

GCGGAGGCCA CTGTGGGAGG CTCGGGACCA AGAGCAGGAC TCGCTACGTC CGGGTCCAGC 1200 

CCGCCAACAA CGGGAGCCCC TGCCCCGAGC TCGAAGAAGA GGCTGAGTGC GTCCCTGATA 1260 

ACTGCGTC TA A GACCAGAGC CCCGCAGCCC CTGGGGCCCC CGGAGCCATG GGGTGTCGGG 1320 

GGCTCCTGTG CAGGCTCATG CTGCAGGCGG CCGAGGCACA GGGGGTTTCG CGCTGCTCCT 13 80 

GACCGCGGTG AGGCCGCGCC GACCATCTCT GCACTGAAGG GCCCTCTGGT GGCCGGCACG 1440 

GGCATTGGGA AACAGCCTCC TCCTTTCCCA ACCTTGCTTC TTAGGGGCCC CCGTGTCCCG 1500 

TCTGCTCTCA GCCTCCTCCT CCTGCAGGAT AAAGTCATCC CCAAGGCTCC AGCTACTCTA IS 60 

AATTATGGTC TCCTTATAAG TTATTGCTGC TCCAGGAGAT TGTCCTTCAT CGTCCAGGGG 1620 

CCTGGCTCCC ACGTGGTTGC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTGAGCCCA 1680 

CTCTCCCGAG GGCGCATCCA AGCGGGGGCC ACTTGAGAAG TCAATAAATG GGGCGGTTTC 1740 

GGAAGCGTCA GTGTTTCCAT GTTATGGATC TCTCTGCGTT TGAATAAAGA CTATCTCTGT 1800 
TGCTCAC 



SEQ ID NO:14 CJA5 Protein sequence: 
Prolein Accession*: NP_036577 

1 11 21 31 41 51 

I I ■ I I I I 

MENPSPAAAL GKALCALLLA TLGAAGQPLG GES1CSARAP AKYS1TFTGK WSQTAFPKQY 60 
PLFRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDFABR GEAWALMKEI EAAGEALQSV 120 
HAVFSAPAVP SGTGQTSAEL EVQRRHSLVS FWRIVPSPD WFVGVDSLDL CDGDRWREQA 180 
ALDLYPYDAG TDSGFTFSSP NFATIPQDTV TEITSSSPSH PANSFYYPRL KALPPIARVT 240 
LVRLRQSPRA FIPPAPVLPS RDNEIVDSAS VPETPLDCEV SLWSSWGLCG GHCGRLGTKS 300 
RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 



SEQ ID NO:15 LBH9 DNA SEQUENCE 

Nucleic Acid Accession*: NMJH2391 

Coding sequence: 26-457 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 3 60 

CAATCCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCG TGCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
TAATAT 



CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
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SEQ ID HO:17 LEM9 DNASEQUENCE 

out: NM_005244 

1-1617 (underlined sequences correspond to start and slop codons) 



I I I I I 1 

ATGGTAGAAC TAGTGATCTC ACCCAGCCTC ACTGTAAACA GCGATTGTCT GGATAAACTG 60 

AAGTTTAACC GTGCTGACGC TGCTGTGTGG ACTCTGAGTG ACAGACAAGG CATCACCAAA 120 

TCGGCCCCCC TGAGAGTGOC CCAGCTCTTC TCCAGATCTT GCCCACGTGT CCTCCCCCGC 180 

CAGCCTTCCA CAGCCATGGC AGCCTACGGC CAGACGCAGT ACAGTGCGGG GATCCAGCAG 240 

GCTACCCCCT ATACAGCTTA CCCACCTCCA GCACAAGCCT ATGGAATCCC TTCCTACAGC 300 

ATCAAGACAG AAGACAGCTT GAACCATTCC CCTGGCCAGA GTGGATTCCT CAGCTATGGC 360 

TCCAGCTTCA GCACCTCACC CACTGGACAG AGCCCATACA CCTACCAGAT GCACGGCACA 420 

ACAGGGTTCT ATCAAGGAGG AAATGGACTG GGCAACGCAG CCGGTTTCGG GAGTGTGCAC 480 

CAGGACTATC CTTCCTACCC CGGCTTCCCC CAGAGCCAGT ACCCCCAGTA TTACGGCTCA 540 

TCCTACAACC CTCCCTACGT CCCGGCCAGC AGCATCTGCC CTTCGCCCCT CTCCACGTCC 600 

ACCTACGTCC TCCAGGAGGC ATCTCACAAC GTCCCCAACC AGAGTTCCGA GTCACTTGCT 660 

GGTGAATACA ACACACACAA TGGACCTTCC ACACCAGCGA AAGAGGGAGA CACAGACAGG 720 

CCGCACCGGG CCTCCGACGG GAAGCTCCGA GGCCGGTCTA AGAGGAGCAG TGACCCGTCC 780 

CCGGCAGGGG ACAATGAGAT TGAGCGTGTG TTCGTGTGGG ACTTGGATGA GACAATAATT 840 

ATTTTTCACT CCTTACTCAC GGGGACATTT GCATCCAGAT ACGGGAAGGA CACCACGACG 900 

TCCGTGCGCA TTGGCCTTAT GATGGAAGAG ATGATCTTCA ACCTTGCAGA TACACATCTG 960 

TTCTTCAATG ACCTGGAGGA TTGTGACCAG ATCCACGTK3 ATGACGTCTC ATCAGATGAC 1020 

AATGGCCAAG ATTTAAGCAC ATACAACTTC TCCGCTGACG GCTTCCACAG TTCGGCCCCA 1080 

GGAGCCAACC TGTGCCTGGG CTCTGGCGTG CACGGCGGCG TGGACTGGAT GAGGAAGCTG 1140 

GCCTTCCGCT ACCGGCGGGT GAAGGAGATG TACAATACCT ACAAGAACAA CGTTGGTGGG 1200 

TTGATAGGCA CTCCCAAAAG GGAGACCTGG CTACAGCTCC GAGCTGAGCT GGAAGCTCTC 12 60 

ACAGACC TCT GGCTGACCCA CTCCCTGAAG GCACTAAACC TCATCAACTC CCGGCCCAAC 1320 

TGTGTCAATG TGCTGGTCAC CACCACTCAA CTAATTCCTG CCCTGGCCAA AGTCCTGCTA 1380 

TATGGCCTGG GGTCTGTGTT TCCTATTGAG AACATCTACA GTGCAACCAA GACAGGGAAG 1440 

GAGAGCTGCT TCGAGAGGAT AATGCAGAGA TTCGGCAGAA AAGCTGTCTA CGTGGTGATC 1500 

GGTGATGGTG TGGAAGAGGA GCAAGGAGCG AAAAAGCACA ACATGCGTTT CTGGCGGATA 1560 
TCCIGCCACG CAGACCTGGA GGCACTGAGG CACGCCCTGG AACTGGAGTA TTTATAG 



1 11 21 31 41 51 

I I I I I I 

MVELVISPSL TVNEDCLDKL KFNRADAAVW TLSDRQGITK SAPLRVSQLF SRSCPRVLPR 60 

QPSTAMAAYG QTQYSAGIQQ ATEYTAYPPP AQAYGIPSYS IKTBDSLNHS PGQSGFLSYG 120 

SSFSTSPTGQ SPYTYQMHGT TGFYQGGNGL GNAAGFGSVH CDYPSYPGFP QSQYPQYYGS 180 

■ SYNPPYVPAS'SICPSPLSTS TYVLQEASHN -VPNQSSESLA GEYNTHNGPS TPAKEGDTDR 240 

PHRAEDGKLR GRSKRSSDPS PAGDNEIERV FVWDLDETII IFHSLLTGTF ASRYGKDTTT 300 

SVRIGIJIMEE MIFNLADTHL FFNDLEDCDQ IHVDDVSSDD NGQDLSTYNF SADGFHSSAP 360 

GANLCLGSGV HGGVDWMRKL AFRYRRVKEM YNTYKNWGG UGTPKRETW LQLRAELEAL 420 

TDLWLTHSLK ALNLINSRPN CVNVLVTTTQ LIPALAKVLL YGLGSVFPIE NIYSATKTGK 480 
ESCFERIMQR FGRKAVYWI GDGVEEEQGA KKHNMPFWRI SCHADLEALR HALELEYL 

SEQ ID N0:19 0AA1 DNA SEQUENCE 

Nucleic Acid Accession*: NM.002740 

Codinj sequence: 178-1968 (underlined sequences correspond to start and slop codons) 



1 11 21 31 41 51 

I 1 I I I I 

CCGCGGTTCC GGCTGCTCCG GCGAGGCGAC CCTTGGGTCG GCGCTGCGGG C~* 
AGGTAGGTGG GCGGACGGCC GCGGTTCTCC GGCAAGCGCA GGCGGCGGAG TCCCCCACGG 
CGCCCGAAGC GCCCCCCGCA CCCCCGGCCT CCAGCGTTGA GGCGGGGGAG TGAGGAGATG 
CCGACCCAGA GGGACAGCAG CACCATGTCC CACACGGTCG CAGGCGGCGG CAGCGGGGAC 
CATTCCCACC AGGTCCGGGT GAAAGCCTAC TACCGCGGGG ATATCATGAT AACACATTTT 
GAACCTTCCA TCTCCTTTGA GGGCCTTTGC AATGAGGTTC GAGACATGTG TTCTTTTGAC 
AACGAACAGC TCTTCACCAT GAAATGGATA GATGAGGAAG GAGACCCGTG TACAGTATCA 
TCTCAGTTGG AGTTAGAAGA AGCCTTTAGA CTTTATGAGC TAAACAAGGA TTCTGAACTC 
TTGATTCATG TGTTCCCTTG TGTACCAGAA CGTCCTGGGA TGCCTTGTCC AGGAGAAGAT 
AAATCCATCT ACCGTAGAGG TGCACGCCGC TGGAGAAAGC TTTATTGTGC CAATGGCCAC 
ACTTTCCAAG CCAAGCGTTT CAACAGGCGT GCTCACTGTG CCATCTGCAC A 
TGGGGACTTG GACGCCAAGG ATATAAGTGC ATCAACTGCA A 
TGGCATAAAC TCGTCACAAT TGAATGTGGG CGGCATTCTT TGCCACAGGA ACCAGTGATG 780 
CCCATGGATC AGTCATCCAT GCATTCTGAC CATGCACAGA CAGTAATTCC ATATAATCCT 840 
TCAAGTCATO AGAGTTTGGA TCAAGTTGGT GAAGAAAAAG AGGCAATGAA CACCAGGGAA 900 

TTTTG ATTTGCTCCG GGTAATAGGA 960 

\AAAA AAACAGATCG TATTTATGCA 1020 
1080 
1140 



TTTCAGACAG AAAGCAGATT GTTCTTTGTT ATAGAGTATG TAAATGGAGG AGACCTAATG 1200 
C AGCGACAAAG AAAACTTCCX GAAGAACATG CCAGATTTTA CTCTGCAGAA 1260 
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ATCAGTCTAG CATTAAATTA TCTTCATGAG CGAGGGATAA TTTATAGAGA TTTGAAACTG 1320 
GACAATGTAT TACTGGACTC TGAAGGCCAC ATTAAACTCA CTGACTACGG CATGTGTAAG 
GAAGGATTAC GGCCAGGAGA TACAACCAGC A 
CCTGAAATTT TAAGAGGAGA AGATTATGGT TTCAGTGTTG A 
CTCATGTTTG AGATGATGGC AGGAAGGTCT CCATTTGATA T 
CCTGACCAGA ACACAGAGGA TTATCTCTTC C 

CCACGTTCTC TGTCTGTAAA AGCTGCAAGT GTTCTGAAGA GTTTTCTTAA TAAGGACCCT 
AAGGAACGAT TGGGTTGTCA TCCTCAAACA GGATTTGCTG ATATTCAGGG ACACCCGTTC 
TTCCGAAATG TTGATTGGGA TATGATGGAG CAAAAACAGG TGGTACCTCC CTTTAAACCA 
AATATTTCTG GGGAATTTGG TTTGGACAAC TTTGATTCTC AGTTTACTAA TGAACCTGTC 
CAGCTCACTC CAGATGACGA TGACATTGTG AGGAAGATTG ATCAGTCTGA ATTTGAAGGT 
TTTGAGTATA TCAATCCTCT TTTGATGTCT GCAGAAGAAT GTGTCTGATC CTCATTTTTC 
AACCATGTAT TOTACTCATG TTGCCATTTA ATGCATGGAT AAACTTGCTG CAAGCCTGGA 2040 
TACAATTAAC CATTTTATAT TTGCCACCTA CAAAAAAACA CCCAATATCT TCTCTTGTAG 2100 
ACTATATGAA TCAATTATTA CATCTGTTTT ACTATGAAAA AAAAATTAAT ACTACTAGCT 2160 
TCCAGACAAT CATGTCAAAA TTTAGTTGAA C 
ATGAGTAATG AAGTTACCTT TTTTGTTTAA AAAAAAAAAA G 



I I I I I I 
HSHTVAGGGS GDHSHQVRVK AYYRGDIMIT HPEPSISFEG LCNEVRDMCS FDNEQLFTMK 
WIDEEGDPCT VSSQLELEEA FRLYELNKDS ELLIHVFPCV PERPGMPCPG EDKSrYRRGA 
T GHTFQAKRFN RRAHCAICTD RIWGLGRQGY KCINCKLLVH KKCHKLVTIE 



20 
25 



GLQDFDLLRV IGRGSYAKVL LVRLKKTDRI YAMKWKKEL VNDDEDIDWV QTEKHVFEQA 
30 SNHPFLVGLH SCFQTESRLF FVIEYVNGGD LMFHMQRQRK LPEEHARFYS AEISLALNYL 
HERGIIYRDL KLDNVLLDSE GHIKLTDYGM CKEGLRPGDT TSTFCGTPNY IAPEILRGED 
YGFSVDWWAL GVLMFEMMAG RSPFDIVGSS DNPDQNTEDY LFQVILEKQ1 R1PRSLSVKA 
ASVLKSFLNK DPKERLGCHP QTGFADIQGH PFFRNVDWDM MEQKQWPPF I 
^ DNFDSQFTNE PVQLTPDDDD IVRKIDQSEF EGFEYINPLL MSAEECV 

SEQIDN0:21 0BH2 DNA 

Nucla'c Acid Accession*: L05628 

Coding sequence: 1 97-4792 (underlined sequences correspond to start and stop radons) 




TGCCCGCCGC CGCCCGCGCC AGCAACCGGG CCCGATCACC C 
CGCCCGCGCC ACCGGCATGG CGCTCCGGGG CTTCTGCAGC GCCGATGGCT CCGACCCGCT 
CTGGGACTGG AATGTCACGT GGAATACCAG CAACCCCGAC TTCACCAAGT GCTTTCAGAA 
CACGGTCCTC GTGTGGGTGC CTTGTTTTTA CCTCTGGGCC TGTTTCCCCT TCTACTTCCT 
CTATCTCTCC CGACATGACC GAGGCTACAT TCAGATGACA CCTCTCAACA AAACCAAAAC 
TGCCTTGGGA TTTTTGCTGT GGATCGTCTG CTGGGCAGAC CTCTTCTACT CTTTCTGGGA 



840 

:cagcc 900 



CCTGGAGGGC AGTGACCTCT GGTCCTTAAA CAAGGAGGAC ACGTCGGAAC AAGTCGTGCC 

TGTTTTGGTA AAGAACTGGA AGAAGGAATG CGCCAAGACT AGGAAGCAGC CGGTGAAGGT 1020 

TGTGTACTCC TCCAAGGATC CTGCCCAGCC GAAAGAGAGT TCCAAGGTGG ATGCGAATGA 1080 

GGAGGTGGAG GCTTTGATCG TCAAGTCCCC ACAGAAGGAG TGGAACCCCT CTCTGTTTAA 1140 

GGTGTTATAC AAGACCTTTG GGCCCTACTT CCTCATGAGO TTCTTCTTCA AGGCCATCCA 1200 

CGACCTGATG ATGTTTTCCG GGCCGCAGAT CTTAAAGTTG CTCATCAAGT TCGTGAATGA 1260 

CACGAAGGCC CCAGACTGGC AGGGCTACTT CTACACCGTG CTGCTGTTTG TCACTGCCTG 1320 

CCTGCAGACC CTCGTGCTGC ACCAGTACTT CCACATCTGC TTCGTCAGTG GCATGAGGAT 1380 

CAAGACCGCT GTCATTGGGG CTGTCTATCG GAAGGCCCTG GTGATCACCA ATTCAGCCAG 1440 

AAAATCCTCC ACGGTCGGGG AGATTGTCAA CCTCATGTCT GTGGACGCTC AGAGGTTCAT 1500 

GGACTTGGCC ACGTACATTA ACATGATCTG GTCAGCCCCC CTGCAAGTCA TCCTTGCTCT 1S60 

CTACCTCCTG TGGCTGAATC TGGGCCCTTC CGTCCTGGCT GGAGTGGCGG TGATGGTCCT 1620 

CATGGTGCCC GTCAATGCTG TGATGGCGAT GAAGACCAAG ACGTATCAGG TGGCCCACAT 1680 

GAAGAGCAAA GACAATCGGA TCAAGCTGAT GAACGAAATT CTCAATGGGA TCAAAGTGCT 1740 

AAAGCTTTAT GCCTGGGAGC TGGCATTCAA GGACAAGGTG CTGGCCATCA GGCAGGAGGA 1800 

GCTGAAGGTG CTGAAGAAGT CTGCCTACCT GTCAGCCGTG GGCACCTTCA CCTGGGTCTG 1860 

CACGCCCTTT CTGGTGGCCT TGTGCACATT TGCCGTCTAC GTGACCATTC ACGAGAACAA 1920 

CATCCTGGAT GCCCAGACAG CCTTCGTGTC TTTGGCCTTG TTCAACATCC TCCGGTTTCC 1980 

CCTGAACATT CTCCCCATGG TCATCAGCAG CATCGTGCAG GCGAGTGTCT CCCTCAAACG 2040 

CCTGAGGATC TTTCTCTCCC ATGAGGAGCT GGAACCTGAC AGCATCGAGC GACGGCCTGT 2100 

CAAAGACGGC GGGGGCACGA ACAGCATCAC CGTGAGGAAT GCCACATTCA CCTGGGCCAG 2160 
GAGCGACCCT CCCACACTGA ATGGCATCAC CTTCTCCATC CCCGAAGGTG C 
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CGTGGTGGGC CAGGTGGGCT GCGGAAAGTC GTCCCTGCTC TCAGCCCTCT TGGCTGAGAT 22 BO 

GGACAAAGTG GAGGGGCACG TGGCTATCAA GGGCTCCGTG GCCTATGTGC CACAGCAGGC 2340 

CTGGATTCAG AATGATTCTC TCCGAGAAAA CATCCTTTTT GGATGTCAGC TGGAGGAACC 2400 

ATATTACAGG TCCGTGATAC AGGCCTGTGC CCTCCTCCCA GACCTGGAAA TCCTGCCCAG 2460 

5 TGGGGATCGG ACAGAGATTG GCGAGAAGGG CGTGAACCTG TCTGGGGGCC AGAAGCAGCG 2520 

CGTGAGCCTG GCCCGGGCCG TGTACTCCAA CGCTGACATT TACCTCTTCG ATGATCCCCT 2580 

CTCAGCAGTG GATGCCCATG TGGGAAAACA CATCTTTGAA AATGTGATTG GCCCCAAGGG 2640 

GATGCTGAAG AACAAGACGO GGATCTTGGT CACGCACAGC ATGAGCTACT TGCCGCAGGT 2700 

GGACGTCATC ATCGTCATGA GTGGCGGCAA GATCTCTGAG ATGGGCTCCT ACCAGGAGCT 2760 

1U GCTGGCTCGA GACGGCGCCT TCGCTGAGTT CCTGCGTACC TATGCCAGCA CAGAGOAGGA 2820 

GCAGGATGCA GAGGAGAACG GGGTCACGGG CGTCAGCGGT CCAGGGAAGG AAGCAAAGCA 2880 

AATGGAGAAT GGCATGCTGG TGACGGACAG TGCAGGGAAG CAACTGCAGA GACAGCTCAG 2940 

CAGCTCCTCC TCCTATAGTG GGGACATCAG CAGGCACCAC AACAGCACCG CAGAACTGCA 3000 

GAAAGCTGAG GCCAAGAAGG AGGAGACCTG GAAGCTGATG GAGGCTGACA AGGCGCAGAC 3060 

15 AGGGCAGGTC AAGCTTTCCG TGTACTGGGA CTACATGAAG GCCATCGGAC TCTTCATCTC 3120 

CTTCCICAGC ATCTTCCTTT TCATGTGTAA CCATGTGTCC GCGCTGGCTT CCAACTATTG 3180 

GCTCAGCCTC TGGACTGATG ACCCCATCGT CAACGGGACT CAGGAGCACA CGAAAGTCCG 3240 

GCTGAGCGTC TATGGAGCCC TGGGCATTTC ACAAGGGATC GCCGTGTTTG GCTACTCCAT 3300 

GGCCGTGTCC ATCGGGGGGA TCTTGGCTTC CCGCTGTCTG CACGTGGACC TGCTGCACAG 3360 

20 CATCCTGCGG TCACCCATGA GCTTCTTTGA GCGGACCCCC AGTGGGAACC TGGTGAACCG 3420 

CTTCTCCAAG GAGCTGGACA CAGTGGACTC CATGATCCCG GAGGTCATCA AGATGTTCAT 3480 

GGGCTCCCTC TTCAACGTCA TTGGTGCCTG CATCGTTATC CTGCTGGCCA CGCCCATCGC 3540 

CGCCATCATC ATCCCGCCCC TTGGCCTCAT CTACTTCTTC G1CCAGAGGT TCTACGTGGC 3600 

TTCCTCCCGG CAGCTGAAGC GCCTCGAGTC GGTCAGCCGC TCCCCGGTCT ATTCCCATTT 3660 

25 CAACGAGACC TTGCTGGGGG TCAGCGTCAT TCGAGCCTTC GAGGAGCAGG AGCGCTTCAT 3720 

CCACCAGAGT GACCTGAAGG TGGACGAGAA CCAGAAGGCC TATTACCCCA GCATCGTGGC 3780 

CAACAGGTGG CTGGCCGTGC GGCTGGAGTG TGTGGGOAAC TGCATCGTTC TGTTTGCTGC 3840 

CCTGTTTGCG GTGATCTCCA GGCACAGCCT CAGTGCTGGC TTGGTGGGCC TCTCAGTGTC 3900 

TTACTCATTG CAGGTCACCA CGTACTTGAA CTGGCTGGTT CGGATGTCAT CTGAAATGGA 3960 

30 AACCAACATC.GTGGCCGTGG AGAGGCTCAA GGAGTATTCA GAGACTGAGA AGGAGGCGCC 4020 

CTGGCAAATC CAGGAGACAG CTCCGCCCAG CAGCTGGCCC CAGGTGGGCC GAGTGGAATT 4080 

CCGGAACTAC TGCCTGCGCT ACCGAGAGGA CCTGGACTTC GTTCTCAGGC ACATCAATGT 4140 

CACGATCAAT GGGGGAGAAA AGGTCGGCAT CGTGGGGCGG ACGGGAGCTG GGAAGTCGTC 4200 

CCTGACCCTG GGCTTATTTC GGATCAACGA GTCTGCCGAA GGAGAGATCA TCATCGATGG 4260 

35 CATCAACATC GCCAAGATCG GCCTGCACGA CCTCCGCTTC AAGATCACCA TCATCCCCCA 4320 

GGACCCTGTT TTGTTTTCGG GTTCCCTCCG AATGAACCTG GACCCATTCA GCCAGTACTC 4380 

GGATGAAGAA GTCTGGACGT CCCTGGAGCT GGCCCACCTG AAGGACTTCG TGTCAGCCCT 4440 

TCCTGACAAG CTAGACCATG AATGTGCAGA AGGCGOGGAG AACCTCAGTG TCGGGCAGCG 4500 

CCAGCTTGTG TGCCTAGCCC GGGCCCTGCT GAGGAAGACG AAGATCCTTG TGTTGGATGA 4560 

40 GGCCACGGCA GCCGTGGACC TGGAAACGGA CGACCTCATC CAGTCCACCA TCCGGACACA 4620 

GTTCGAGGAC TGCACCGTCC TCACCATCGC CCACCGGCTC AACACCATCA TGGACTACAC 4680 

AAGGGTGATC GTCTTGGACA AAGGAGAAAT CCAGGAGTAC GGCGCCCCAT CGGACCTCCT 4740 

GCAGCAGAGA GGTCTTTTCT ACAGCATGGC CAAAGACGCC GGCTTGGTG T GAG CCCCAGA 4800 

_ GCTGGCATAT CTGGTCAGAA CTGCAGGGCC TATATGCCAG CGCCCAGGGA GGAGTCAGTA 4860 

45 CCCCTGGTAA ACCAAGCCTC - CCACACTGAA ACCAAAACAT AAAAACCAAA CCCAGACAAC 4920 

CAAAACATAT TCAAAGCAGC AGCCACCGCC ATCCGGTCCC CTGCCTGGAA CTGGCTGTGA 4980 
AGACCCAGGA GAGACAGAGA TGCGAACCAC C 



50 SEQ ID NO:22 OBH2 Protein sequence: 
Protein Accession*: AAB46616 

1 11 21 31 41 51 

I I I I I I 

J J MALRGFCSAD GSDPLWDWNV TWNTSNPDFT KCFQNTVLVW VPCFYLWACF PFYFLYLSRK 60 

DSGYIQMTFL NKTKTALGFL DWIUCWADLF YSFWERSRGI FLAPVFI»VSP TLLGITTLLA 120 

TFLIQLERRK GVQSSGIMLT FWLVAOTCAL AILRSKIMTA LKEDAQVDLF RDITFYVYFS 180 

MJ.IQLVLSC FSDRSPLFSE TIHDPNPCPE SSASFLSRIT FWWITGL1TO GYRQPLEGSD 240 

LWSLNKEDTS EQWFVLVKN WKKECAKTRK QPVKWYSSK DPAQPKESSK VDANEEVEAL 300 

60 IVKSPQKEWN PSLFKVLYKT FGPYFLMSFF FKAIHDLMMF SGPQILKLLI KFVNDTKAPD 360 

WQGYFYTVLL FVTACLQTLV U3QYFHICFV SGMRIKTAVI GAVYRKALVI TNSARKSSW 420 

GEIVNLHEVD AQRFMDLATY INMIWSASLQ VILALYLLWL NLGPSVLAGV AVMVtMVPVN 480 

AVMAMKTKTY QVAHMKSKDK RIKLMNEILN GIKVLKLYAW ELAFKDKVLA IRQEELKVLK 540 

KSAYLSAVGT FTWVCTPFLV ALCTFAVYVT IDENNILDAQ TAFVSLALFN 1LRFPLNILP 600 

65 MVISSIVQAS VSLKRLRIFL SHEELEPDSI ERRPVKDGGG TNSITVRNAT FTWARSDPPT 660 

LNGITFSIPE GALVAWGQV GCGKSSLLSA LLAEMDKVEG HVAIKGSVAY VPQQAWIQND 720 

SLRENILFGC QLEEPYYRSV IQACALLPDL EILPSGDRTE 1GEKGVNLSG GQKQRVSLAR 780 

AVYSNADIYL FDDPLSAVDA HVGKHIFEtTV 1GPKGMLKNK TRILVTHSMS YLPQVDVIIV 840 

MSGGKISEMG SYQELLARDG AFAEFLRTYA STEQEQDAEE NGVTGVSGPG KEAKQMENGH 900 

70 LVTDSAGKQL QRQLSSSSSY SGD1SRHHNS TAELQKAEAK KEETWKLMEA DKAQTGQVKL 960 

SVYWDYMKAI GLFISFLSIF LFMCNHVSAL ASNYWLSLWT DDPIWGTQE HTKVRLSVYG 1020 

ALGISQGIAV FGYSMAVSIG GILASRCLHV DLLHSILRSP MSFFERTPSG NLVNRFSKEL 1080 

DTVDSMIPEV IKMFHGSLFN VIGACIVILL ATPIAAIIIP PLGLIYFFVQ RFYVASSRQL 1140 

KRLESVSRSP VYSHFNETLL GVSVIRAFEE QERFIHQSDL KVDENQKAYY PSIVANRWLA 1200 

75 VRLECVGNCI VLFAALFAV I SRHSLSAGLV GDSVSYSLQV TTYLNWLVRM SSEMETNIVA 1260 

VERLKEYSET EKEAPWQIQE TAPPSSWPQV GRVEFBNYCL RYREDLDFVL RHINVTINGG 1320 

EKVGIVGRTC AGKSSLTLGL FR1NESAEGE IIIDGINIAK IGLHDLRFKI T1IPQDPVLF 1380 

SGSLRMNLDP FSQYSDEEVW TSLELAHLKD FVSALPDKLD HECAEGGENt, SVGQRQLVCL 1440 

ARALLRKTKI LVLDEATAAV DLETDDLIQS TIRTQFEDCT VLTIAHRLNT IMDYTRV1VL 1500 

80 DKGEIQEYGA PSDLLQQRGL FYSMAKDAGL V 
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SEQ ID N0:23 PAA2 DNA SEQUENCE 

Nucleic Acid Accession »: NM.013309 

Coding sequence: 1-1290 (underlined sequences correspond lo start and slop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGGCCGGCT CTGGCGCGTG GAAGCGCCTC AAATCTATGC TAAGGAAGGA TGATGCGGCG 60 

CTGTTTTTAA ATGACACCAG CGCCTTTGAC TTCTCGGATG AGGCGGGGGA CGAGGGGCTT 120 

TCTCGGTTCA ACAAACTTCG AGTTGTGGTG GCCGATGACG GTTCCGAAGC CCCGGAAAGG 180 

CCTGTTAACG GGGCGCACCC GACCCTCCAG GCCGACSATG ATTCCTTACT GGACCAAGAC 240 

TTACCTTTGA CCAACAGTCA GCTGAGTTTG AAGGTGGACT CCTGIGACAA CTGCAGCAAA 300 

CAGAGAGAGA TACTGAAGCA GAGAAAGGTG AAAGCCAGGT TGACCATTGC TGCCGTTCTG 360 

TACTTGCTTT TCATGATTGG AGAACTTGTA GGTGGATACA TTGCAAATAG CCTAGCAATC 420 

ATGACAGATG CACTTCATAT GTTAACTGAC CTAAGCGCCA TCATACTCAC CCTGCTTGCT 480 

TTGTGGCTAT CATCAAAATC ACCAACCAAA AGATTCACCT TTGGATTTCA TCGCTTAGAG 540 

GTTTTGTCAG CTATGATTAG TGTGCTGTTG GTGTATATAC TTATGGGATT CCTCTTATAT 600 

GAAGCTGTGC AAAGAACTAT CCATATGAAC TATGAAATAA ATGGAGATAT AATGCTCATC 660 

ACCGCAGCTG TTGGAGTTGC AGTTAATGTA ATAATGGGGT TTCTGTTGAA CCAGTCTGGT 720 

CACCGTCACT CCCATTCCCA CTCCCTCCCT TCAAATTCCC CTACCAGAGG TTCTGGGTGT 780 

GAACGTAACC ATGGGCAGGA TAGCCTGGCA GTGAGAGCTG CATTTGTACA TGCTTTGGGA 840 

GATTTGGTAC AGAGTGTTGG TGTGCTAATA GCTGCATACA TCATACGATT CAAGCCAGAA 900 

TACAAGATTG CTGATCCCAT CTGTACATAC GTATTTTCAT TACTTGTGGC TTTTACAACA 960 

TTTCGAATCA TATGGGATAC AGTAGTTATA ATACTAGAAG GTGTGCCAAG CCATTTGAAT 1020 

GTAGACTATA TCAAAGAAGC CTTGATGAAA ATAGAAGATG TATATTCAGT CGAAGATTTA 1080 

AATATCTGGT CTCTCACTTC AGGAAAATCT ACTGCCATAG TTCACATACA GCTAATTCCT 1140 

GGAAGTTCAT CTAAATGGGA GGAAGTACAG TCCAAAGCAA ACCATTTATT ATTGAACACA 1200 

TTTGGCATGT ATAGATGTAC TATTCAGCTT CAGAGTTACA GGCAAGAAGT GGACAGAACT 1260 
TGTGCAAATT GTCAGAGTTC TAGTCCCTGA 



1 11 21 31 41 51 

I ] I I I I 

MAGSGAWKRL KSMLRKDDAP LFLNDTSAFD FSDEAGDEGL SRFNKLRWV AT"" 
PVNGAHPTLQ AUDDSLLDQD LPLTNSQLSL KVDSCDNCSK QREH-KQRKV KARLTIAAVL 
YLLFMIGELV GGYIANSLAI MTDALHMLTD LSA1ILTLLA LWLSSKSPTK RFTFGFHRLE 
VLSAMISVLL VYILMGFLLY EAVQRTIHMN YEINGDIMLI TAAVGVAVNV IKGFLLNQSG 
HRHSHSHSLP SNSPTRGSGC ERNHGQDSLA VRAAFVHALG DLVQSVGVLI A 
YKIADPICTY VFSLLVAFTT FRIIWDTWI ILEGVPSHLN VDYIKE/ 
NIWSLTSGKS TAIVHIQLIP GSSSKWEEVQ SKANHLLLNT FGMYRCTIQL QSYRQEVDRT 420 
CANCQSSSP 

SEQ ID N0:25 PAA3 DNA SEQUENCE 

Nucleic Acid Accession #: AB037765 

Codiny sequence: 375-2798 (underiined sequences correspond to start andstop codons) 



CGCGTGGCGG CGGGAACTGT TGGCCGCGCG GCCTCGGGAA CGGCCCAGGT CCCCGCCCGC 180 

AGGTCCCGGG CAGATAACAT AGATCATCAG TAGAAAACTT CTTGAAGTTG TTCAAGAAAA 240 

ATTTGAAAGT AGCAAAATAG AAAATAAAGA ATTAACAGCA GATACAGAGG ACAGCATGGA 300 

AGTGTTGTCT TAGGAAACAG AACACAGCAG TGAAAAAACA GACAAAATCC GCTCAGATAC 360 

AACTGCAGCT GATAATGTTT TCCGGCTTCA ATGTCTTTAG AGTTGGGATC TCTTTTGTCA 420 

TAATGTGCAT TTTTTACATG CCAACAGTAA ACTCTTTACC AGAACTGAGT CCTCAGAAAT 480 
ATTTTAGTAC ATTGCAACCA GGTCTTGAAG AACTGAATGA GGCTGTTAGA 0 
ACTATGGAAT TTCAGTTGCC AAGGTTAATT GTGTCAAAGA AGAAAW 

GAAAAGAAAA GGATTTGATG AAAGCATATT TATTCA&GGG CAACATATTG CTCAGAGAAT 660 

TCCCTACTGA CACCTTGTTT GATGTGAATG CCATTGTCGC CCATGTTCTC TTTGCTCTTC 720 

TTTTTAGTGA AGTGAAATAT ATTACCAACC TGGAAGACCT TCAGAACATA GAAAATGCTC 780 

TGAAAGGAAA AGCAAATATT ATATTCTCAT ATGTAAGAGC CATTGGAATA CCAGAGCACA 840 

GAGCAGTCAT GGAAGCCGGT TTTGTGTATG GGACTACATA CCAATTTGTC TTAACCACAG 900 

AAATTGCCCT TTTGGAAAGT ATTGGCTCTG AGGATGTGGA ATATGCACAT CTCTACTTTT 960 

TTCATTGTAA ACTAGTCTTG GACTTGACCC AGCAATGTAG AAGAACACTA ATGGAACAGC 1020 

CATTGACTAC ACTGAACATT CACCTGTTTA TTAAGACAAT GAAAGCACCT CTGTTGACTG 1080 

AAGTTGCTGA AGATCCTCAA CAAGTTTCAA CTGTCCATCT CCAACTGGGC TTACCACTGG 1140 

TTTTTATTGT TAGCCAACAG GCTACTTATG AAGCTGATAG AAGAACTGCA GAATGGGTTG 1200 

CTTGGCGTCT TCTGGGAAAA GCAGGAGTTC TACTCTTGTT AAGGGACTCT TTGGAAGTGA 1260 

ACATTCCTCA AGATGCTAAT GTGGTCTTCA AAAGAGCAGA AGAGGGAGTT CCAGTGGAAT 1320 

TTTTGGTATT ACATGATGTT GATTTAATAA TATCTCATGT GGAAAATAAT ATGCACATTG 1380 

AGGAAATACA AGAAGATGAA GACAATGACA TGGAAGGTCC AGATATAGAT GTTCAGGATG 1440 

ATGAAGTGGC AGAAACTGTT TTCAGAGATA GGAAGAGAAA ATTACCTTTC GAACTTACAG 1500 
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TGGAACTAAC AGAAGAAACA TTTAATGCAA CAGTGATGGC TTCTGACAGC ATAGTACTCT 1560 

S TTGGCAAGCA GTATCCATGG CATTTTTGCA ATCCTATATT GATGTGGCAG 1620 

h AGGCACATCT ACTATGGTTC TTACTAGAAT AAACIGTGCA GATTGGTCTG 1680 

ATGTATGTAC TAAGCAAAAT GTTACTGAAT TTCCTATCAT AAAGATGTAC AAGAAAGGCG 1740 

AGAACCCAGT ATCTTATG CT GGAATGTTAG GAACCAAAGA TCTCCTAAAA TTTATCCAGC 1800 

T TTCATATCCA GTGAATATAA CATC GATCC A AGAAGCAGAA GAATATTTAA 1860 

AGGAAACTAC 



G ACACTGGTAA 22 80 

AGCAGAAATA CTTGGATTCA TTTACTCCAT GCTGGTTAAA TCTAAAGAAT ACTCCAGTGG 2340 

GGAGAGGAAT CTTGCGGGCA TATTTTGATC CTCTGCCTCC CCTTCCTCTT CTTGTTTTGG 2400 

TGAATCTGCA TTCAGGTGGC CAAGTATTTG CATTTCCTTC AGACCAGGCT ATAATTGAAG 2460 

AAAACCTTGT ATTGTGGCTG AAGAAATTAG AAGCAGGACT AGAAAATCAT ATCACAATTT 2520 

TACCTGCTCA AGAATGGAAA CCTCCTCTTC CAGCTTATGA TTTTCTAAGT ATGATAGATG 2580 

CCGCAACATC TCAACGTGGC ACTAGGAAAG TTCCCAAGTG TATGAAAGAA ACAGATGTGC 2640 

AGGAGAATGA TAAGGAACAA CATGAAGATA AATCGGCAGT CAGAAAAGAA CCGATTGAAA 2700 

CTCTGAGAAT AAAGCATTGG AATAGAAGTA ATTGGTTTAA AGAAGCAGAA AAATCATTTA 2760 

GACGTGATAA AGAGTTAGGA TGCTCAAAAG TGAACTAATT TTATAGGGCT GTGGTTTCCA 2820 

AAATTTTTTT GGCATGATAG ACTTAATTTA TTTCCTTAAA GAATAATATT AAATCATTTC 2880 

AAGTTTGCAG ACTAGTGCCA TCCAATAGAA TTATAATATA AGTCACATAT TTTATTTAAA 2940 

ATTTTCTAGT AACTACATTA AACAAAGTAA AAGTGAGCAG GGCAAAATAA TTTTGATATT 3000 

ACTTTTCACC CAGTAGTATA CCCAAAATAG CGAAATATAG AAATTATTAA TGAGATATTT 3060 

TACATCCTTT TTTGTACCAA GTCTTCTAAA TGCAGTACAT ATTTTATACT TACTGCATTT 3120 

CTTACTTCCG AGTAGCCATA TTTCAAGTGT TCATTGCCAC ATGTGGCCTG TGACTACTGT 3180 

ATTGGACAGT TCAGTACTAG ACAAAAACTA GCATAATTAA CTTAGTTCTA GCCATGATTT 3240 

CTATTTGGAT TAAAATTAAA CTCTAATCAC AGTTAACTCC ACAGTGCATT CATGCAGCTG 3300 

ACAGTTATAT TTGTTOTATT GGAGTCATGA TATTAAAA1C AGCGTTTGTC AACCTCAGGG 3360 

GATATTTAGC AATTGTCGGG AGACATTTTT GATGTCATGA CTAGGGCAGT TATTGACATT 3420 

TAGTGAGTAG AGGCCATGGA TCCTGCTAAA TAACCTGCAT TGGACAGCGC CCCACAACAA 3480 

AGAATTATCC TGCCCGAAAT GGTAGTCGTG CCAAGGCTGA GTAACCTTGT GTTAAAAGTA 3540 

ACCTGTGGCA GACTAGGTTT CCAGAATTTC CTGGTTCTGC TCACGTATCA TGTTTGAAAA 3600 

AATTTTGGCT ATTAAAGATA TGTATTAGAT GGTCTTATCC TGATTATTAC CTGGATACAA 3660 

T TCAGAAAGTG ATGGGATAAC CCTAGAAGAG GACTCAGAAT 3720 

A GTCTTAAAAC CTCCTCTTAT TTCTACAAGT TATATGGCTA 3780 

3 AGAGGCTCCC 3840 

TCATCTGAAG CGTCTCTGAA ATCTACCCTT GCAAGCTTCA GACAAATCAG TTGATCTCCC 3900 
TGAGCCACAC GGCCTCATTC TGTGAGGGAG GGAAAGATTA GCCAAAGAGT T 
TCCAAATCAC TTAGCTGTTA GACTGATCTG TTTGTAGCAG 
CTGTGCATTT TTTGAGACAT TTGTTGAGAA TATTCTATTT 
TTTTTAATAT CTACTTGATA TCTTGTTCTT TAAATTTTCT TCACATATGG T 

ACAACTGATT TTTATAACTG AAATTTAAGG AATCTAACAG CTAAAACTCA GTAAGTGCAT 4200 

MTATTTCCTT ATAACATAGA CCCGTTGCTA CTCTCAGCAC CCTCTCCTCA ATTTTTTTTC 4260 



4320 



CAATGAGAGT GAACTCTAAA TATAGGTTGT AGTAATAAAA CATCATTAGC CTAATTATTA 4380 



SEQ ID NO:2B PAA3 Protein sequence: 



I I I I I I 

MFSGFNVFRV GISFVIMCTF YMPTVNSLPE LSPQKYFSTL QPGLEELHEA VRPLQDYGIS 
VAKVNCVKEE ISRYCGKEKD LMKAYLPKGN ILLREFPTDT LFDVKAIVAH VLFALLFSEV 
KYITNLEDLQ NIENALKGKA NIIFSYVRAI G1PEHRAVME AGFVYGTTYQ FVLTTEIALL 
ESIGSEDVEY AHLYFFHCKL VLDLTQQCRR TLMEQPLTTL HIHLFIKMK APLLTEVAED 
PQQVSTVHLQ .LGLPLVFIVS QQATYEADRR TAEWVAWRLL GKAGVLLLLR DSLEVKIEQD 
ANWFKRAEE GVPVEFLVLH DVDLIISHVE NHHHIEEIQE DEDNDMEGPD IDVQDDEVAE 
TVFRDRKRKL PLELTVELTE ETFNATVMAS DSIVLFYAGW QAVSMAFLQS YIDVAVKLKG 
TSTMLLTRIN CADWSDVCTK QNVTEFPIIK MYKKGENPVS YAGMLGTKDL LKFIQLNRIS 
YPVKITSIQE AEEYLSGELY KDLILYSSVS VLGLFEPTMK TAKEDFSEAG NYLKGYVITG 
IYSEEDVLLL STKYAASLPA LLLARHTEGK IESIPLASTH AQD1VQIITD ALLEMFPEIT 
VENLPSYFRL QKPLLILFSD GTVNPQYKKA ILTLVKQKYL DSFTPCWLNL KNTPVGRGIL 
RAYFDPLPPL PLLVLVNLHS GGQVFAFPSD QAIIEENLVL WLKKLEAGLE NHITILPAQE 
WKPPLPAYDF LSMIDAATSQ RGTRKVPKCM KETDVQENDK EQHEDKSAVR KEPIETLRIK 
HWNRSNWFKE AEKSFRRDKE LGCSKVN 

SEQ ID N0:27 PAA5 DNA SEQUENCE 

Nucleic Acid Accession*: nm.012449 

Coding sequence: 66-1085 (underlined sequences correspond to start and stop codons) 



CCGAGACTCA CGGTCAAGCT A 
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GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 
AAAGACCTGT GCTTTTGCAT TTGCACCAAA CAGCCCATGC TGATGAATTT GACTGCCCTT 
CAGAACTTCA GCACACACAG GAACTCTTTC CACAGTGGCA CTTGCCAATT AAAATAGC TG 
CTATTATAGC ATCTCTGACT TTTCTTTACA CTCTTCTGAG GGAAGTAATT CACCCTTTAG 
CAACTTCCCA TCAACAATAT TTTTATAAAA TTCCAATCCT GGTCATCAAC A 
CMTGGTTTC CATCACTCTC TTGGCATT5G TTTACCTGCC AGGTGTC 
TCCAAC1TCA TAATGGAACC AAGTATAAGA AGTTTCCACA TTGGTTGGAT A 
TAACAAGAAA GCAGTTTGGG CTTCTCAGTT TCTTTTTTGC TGTACTGCAT GCAATTTATA 600 
GTCTGTCTTA CCCAATGAGG CGATCCTACA GATACAAGTT GCTAAACTGG GCATATCAAC 660 
AGGTCCAACA AAATAAAGAA GATGCCTGGA TTGAGCATGA TGTTTGGAGA ATGGAGATTT 720 
ATGTGTCTCT GGGAATTGTG GGATTGGCAA TACTGGCTCT GTTGGCTGTG ACATCTATTC 780 
CATCTGTGAG TGACTCTTTG ACATGGAGAG AATTTCACTA TATTCAGAGC AAGCTAGGAA 840 
TTGTTTCCCT TCTACTGGGC ACAATACACG CATTGATTTT TGCCTGGAAT AAGTGGATAG 900 
ATATAAAACA ATTTGTATGG TATACACCTC CAACTTTTAT GATAGCTGTT TTCCTTCCAA 960 
TTGTTG1CCT GATATTTAAA AGCATACTAT TCCTGCCATG CTTGAGGAAG AAGATACTGA 1020 
AGATTAGACA TGGTTGGGAA GACGTCACCA AAATTAACAA AACTGAGATA TGTTCCCAGT 1080 
TGTAGAATTA CTGTTTACAC ACATTTTTGT TCAATATTGA TATATTTTAT CACCAACATT 1140 
TCAAGTTTGT ATTTGTTAAT AAAATGATTA TTCAAGGAAA AAAAAAAAAA AAAAA 



1 11 21 31 41 51 

I I I I I I 

MBSRKDITNQ EELVIKMKPRR NLEEDDYLHK DTGETSMLKR FVLLHLHQTA HADEFDCPSE 
LQHTQELFPQ WHLPIKIAAI IASLTFLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 
VSITLLALVY LPGVIAAIVQ LHNGTKYKKF PHWLDKWMLT RKQFGLLSFF FAVLHAIYSL 
SYPMEHSYRY KLLNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL ALLAVTSIPS 
VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 
VLIFKSILFL PCLRKKILKI RHGWEDVTKI NKTEICSQL 

SEQ ID NO:29 PAA7 DNA SEQUENCE 

Nucleic Acid Accession*: NM_030774 

Coding sequence: 1-963 (underlined sequences correspond to start and stop codons) 



AAAGCCCATT TCTGGGTTGG CTTCCCCCTC CTTTCCATGT ATGTAGTGGC AATGTTTGGA 
AACTGCATCG TGGTCTTCAT CGTAAGGACG GAACGCAGCC TGCACGCTCC GATGTACCTC 
TTTCTCTGCA TGCTTGCAGC CATTGACCTG GCCTTATCCA CATCCACCAT GCCTAAGATC 
CTTGCCCTTT TCTGGTTTGA TTCCCGAGAG ATTAGCTTTG AGGCCTGTCT TACCCAGATG 
TTATTC ATGCCCTCTC A 



480 

i TTGTGTCCAC 540 



CAGGATGTAA TGAAGTTGGC CTATGCAGAC ACTTTGCCCA ATGTGGTATA TGGTCTTACT 

GCCATTCTGC TGGTCATCGG CGTGGACGTA ATGTTCATCT CCTTGTCCTA TTTTCTGATA 660 

ATACGAACGG TTCTGCAACT GCCTTCCAAG TCAGAGCGGG CCAAGGCCTT TGGAACCTGT 720 

GTGTCACACA TTGGTGTGGT ACTCGCCTTC TATGTGCCAC TTATTGGCCT CTCAGTGGTA 780 

CACCGCTTTG GAAACAGCCT TCATCCCATT GTGCGTGTTG TCATGGGTGA CATCTACCTG 840 

CTGCTGCCTC CTGTCATCAA TCCCATCATC TATGGTGCCA AAACCAAACA GATCAGAACA 900 

CGGGTGCTGG CTATGTTCAA GATCAGCTGT GACAAGGACT TGCAGGCTGT GGGAGGCAAG 960 

TGACCCTTAA CACTACACTT CTCCTTATCT TTATTGGCTT GATAAACATA ATTATTTCTA 1020 

ACACTAGCTT ATTTCCAGTT GCCCATAAGC ACATCAGTAC TTTTCTCTGG CTGGAATAGT 1080 

AGTATTACAT GATTTAAAGA CTACAATAAA ACCAAACATG CTTATAACAT TAAGAAAAAC 1200 

AATAAAGATA CATGATTGAA ACCAAGTTGA AAAATAGCAT ATGCCTTGGA GGAAATGTGC 1260 

TCAAATTACT AATGATTTAG TGTTGTCCCT ACTTTCTCTC TCTTTTTTCT TTCTTTTTTT 1320 

TTTATTATGG TTAGCTGTCA CATACAACTT TTTTTTTTTT TGAGATGGGG TCTCGCTCTG 1380 

TCACCAGGCT GGAGTGCAGT GGCGCGATCT CGGCTCACTG CAACCTCCAC ATCCCATGTT 1440 

GAAGTAATTC TTCTGCCTCA GCCTCCCGAG TAGCTGGGAC TAGAGGAACG TGCCACCATG 1500 

ACTGGCTAAT TTTCTGTATT TTTTAGTAGA GACAGAGTTT CACCATGTTG GCCAGGATGG 1560 

TCTCGATCTC CTGACCTTGT GATCCACCCG CCTCAGCCTC CCAAAGTGTT GGGATTACAG 1620 

GTGTGAACCA CTGTGCCCGG CCTGTGTACA ACTTTTTAAA TAGGGAATAT GATAGCTTCG 1680 

G TGCACCTATA GCCCCCACTG CCTGGAAAGC TGAGGTGGGA GAATCGCTTG 1740 

G TTTGAGGTTA CAGTGATCCA CGATCGTACC ACTACACTCC AGCCTGGGCA 1800 

;TGTCTC AAAGCATAAA ATGGAATAAC ATATCAAATG AAACAGGGAA 1860 

AATGAAGCTG ACAATTTATG GAAGCCAGGG CTTGTCACAG 'TCTCTACTGT TATTATGCAT 1920 

TACCTGGGAA TTTATATAAG CCCTTAATAA TARTGCCAAT GAACATCTCA TGTGTGCTCA 1980 

CAATGTTCTG GCACTATTAT AAGTGCTTCA CAGGTTTTAT GTGTTCTTCG TAACTTTATG 2040 

GAGTAGGTAC CATTTGTGTC TCTTTATTAT AAGTGAGAGA AATGAAGTTT ATATTATCAA 2100 

GGGGACTAAA GTCACACGGC TTGTGGGCAC TGTGCCAAGA TTTAAAATTA AATTTGATGG 2160 

TTGAATACAG TTACTTAATG ACCATGTTAT ATTGCTTCCT GTGTAACATC TGCCATTTAT 2220 

TTCCTCAGCT GTACAAATCC TCTGTTTTCT CTCTGTTACA CACTAACATC AATGGCTTTG 2280 

TACTTGTGAT GAGAGATAAC CTTGCCCTAG TTGTGGGCAA CACATGCAGA ATAATCCTGT 2340 

TTTACAGCTG CCTTTCGTGA TCTTATTGCT TGCTTTTTTC CAGATTCAGG GAGAATGTTG 2400 

TTGTCTATTT GTCTCTTACA TCTCCTTGAT CATGTCTTCA TTTTTTAATG TGCTCTGTAC 2460 

CTGTCAAAAA TTTTGAATGT ACACCACATG CTATTGTCTG AACTTGAGTA TAAGATAAAA 2520 
TAAAATTTTA TTTTAAATTT T 
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5 
10 
15 



SEQ ID NO:30 PAA7 PROTEIN SEQUENCE 

Protein Accession*: NP_1 10401 

1 11 21 31 41 51 

I I I I I I 

MSSCNFTHAT FVLIGIPGLE KAHFWVGFPL LSMYWAMFG NCIWFT 

PLCMLAAIDL ALSTSTMPKI LALFWFDSRE ISFEACLTQM FFIHAI/SAIE S' 
RWA1CHPLR HAAVLNNTVT AQIGIVAWR GSLFFFPLPL LIKRLAI 
QDVMKLAYAD TLFNWYGLT AILLVMGVDV MFISLSYFLI IRTVLQLPSK S 
VSHIGWLAF YVPLIGLSW HRFGNSLHPI VRWMGDIYL LLPFVINPI1 YGAKTKQ1RT 
RVLAMFKISC DKDLQAVGGK 

SEQ ID N0:31 PAV6 DNA SEQUENCE 
ndstopcodons) 

1 11 21 31 41 51 

I I I I I I 

ATGAACTGGG AGCTGCTGCT GTGGCTGCTG GTGCTGTGCG CGCTGCTCCT GCTCTTGGTG 
CAGCTGGTGC GCTTCCTGAG GGCTGACGGC GACCTGACGC TACTATGGGC CGAGTGGCAG 
GGACGACGCC CAGAATGGGA GCTGACTGAT ATGGTGGTGT GGGTGACTGG AGCCTCGAGT 
GGAATTGGTG AGGAGCTGGC TTACCAGTTG TCTAAACTAG GAGTTTCTCT TGTGCTGTCA 
GCCAGAAGAG TGCATGAGCT GGAAAGGGTG AAAAGAAGAT GCCTAGAGAA TGGCAATTTA 
3 ACACTGGTTC CCATGAAGCG 



GCTACCAAAG CTGTTCTCCA GGAGTTTGGT AGAATCGACA TTCTGGTCAA CAATGGTCGA 420 

ATGTCCCAGC GTTCTCTGTG CATGGATACC AGCTTGGATG TCTACAGAAA GCTAATAGAG 480 

CTTAACTACT TAGGGACGGT GTCCTTGACA AAATGTGTTC TGCCTCACAT GATCGAGAGG S40 

AAGCAAGGAA AGATTGTTAC TGTGAATAGC ATCCTGGGTA TCATATCTGT ACCTCTTTCC 600 

ATTGGATACT GTGCTAGCAA GCATGCTCTC CGGGGTTTTT TTAATGGCCT TCGAACAGAA 660 



ATTGTGGAGA ATTCCCTAGC TGGAGAAGTC ACAAAGACTA TAGGCAATAA TGGAGACCAG 780 
TCCCACAAGA TGACAACCAG TCGTTGTGTG CGGCTGATGT TAATCAGCAT GGCCAATGAT 840 
TTGAAAGAAG TTTGGATCTC AGAACAACCT TTC TTGTTAG TAACATATTT GTGGCAATAC 900 



AAGAGTGGTG TGGATGCAGA CTCTTCTTAT TTTAAAATCT TTAAGACAAA ACATGACTGA 

SEQ ID H0:32 PAV6 Protein sequence 



MFTWAWWITN KMGKKRENF KSGVDADSSY FKIFKTKHD 

SEQ ID N0:33 PBA6 DNA SEQUENCE 

Nucleic Acid Accession*: NM_0O6853 



AGGAATCTGC G 

CATGAGGATT CTGCAGTTAA TCCTCCTTGC TCTGGCAACA GGGCTTGTAG GGGGAGAGAC 180 

CAGGATCATC AAGGGGTTCG AGTCCAAGCC TCACTCCCAG CCCTGGCAGG CAGCCCTGTT 240 

CGAGAAGACG CGGCTACTCT GTOGGGCGAC GCTCATCGCC CCCAGATGGC TCCTGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATCGCCAGT 480 

CTCCATCACC TGGGCK3TGC GACCCCTCAC CCTCTCCTCA CGCTGTGTCA CTGCTGGCAC 540 

CAGCTGCCTC ATTTCCGGCT GGGGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA CCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 660 

& GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 72 0 

K CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

^ CCCGAAAGCC TGGTGTCTAC ACGAAAGTCT GCAAATATGT 840 

GGACTGGATC CAGGAGACGA TGAAGAACAA CTAGACTGGA CCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAGA AACCCTAAGC 960 

CAAGACCCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCCCCAGCCC CAAAGACAGC 1140 
TCCTGGCCAT ATATCAAGGT TTCAATAAAT ATTTGCTAAA TGAGTG 

SEP ID NO:34 PBA6 PROTEIN SEQUENCE 

Protein Accession*: NP.006844 
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MRILQLIMA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT h 
AHCLKPRYIV HLGQHNLQKE EGCEQTRTAT ESFPHPGFNN SLPHKDHRND IMLVKMASPV 
SITWAVRPLT LSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 
NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 



10 



SEQ ID NO:35 PBC1 DNA SEQUENCE 

Nucleic Acid Accession*: NMJXH775 

Coding sequence: 70-972 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTAAAGCTCT CTTGCTGCCT AGCCTCCTGC CGGCCTCATC TTCGCCCAGC CAACCCCGCC 60 

TGGAGCCC TA TG GCCAACTG CGAGTTCAGC CCGGTGTCCG GGGACAAACC CTGCTGCCGG 120 

CTCTCTAGGA GAGCCCAACT CTGTCTTGGC GTCAGTATCC TGGTCCTGAT CCTCGTCGTG 180 

GTGCTCGCGG TGGTCGTCCC GAGGTGGCGC CAGACGTGGA GCGGTCCGGG CACCACCAAG 240 

CGCTTTCCCG AGACCGTCCT GGCGCGATGC GTCAAGTACA CTGAAATTCA TCCTGAGATG 300 

AGACATGTAG ACTGCCAAAG TGTATGGGAT GCTTTCAAGG GTGCATTTAT TTCAAAACAT 360 

CCTTGCAACA TTACTGAAGA AGACTATCAG CCACTAATGA AGTTGGGAAC TCAGACCGTA 420 

CCTTGCAACA AGATTCTTCT TTGGAGCAGA ATAAAAGATC TGGCCCATCA GTTCACACAG 480 

GTCCAGCGGG ACATGTTCAC CCTGGAGGAC ACGCTGCTAG GCTACCTTGC TGATGACCTC 540 

ACATGGTGTG GTGAATTCAA CACTTCCAAA ATAAACTATC AATCTTGCCC AGACTGGAGA 600 

AAGGACTGCA GCAACAACCC TGTTTCAGTA TTCTGGAAAA CGGTTTCCCG CAGGTTTGCA 660 

GAAGCTGCCT GTGATGTGGT CCATGTGATG CTCAATGGAT CCCGCAGTAA AATCTTTGAC 720 

AAAAACAGCA CTTTTGGGAG TGTGGAAGTC CATAATTO3C AACCAGAGAA GGTTCAGACA 780 

CTAGAGGCCT GGGTGATACA TGGTGGAAGA GAAGATTCCA GAGACTTATG CCAGGATCCC 840 

ACCATAAAAG AGCTGGAATC GATTATAAGC AAAAGGAATA TTCAATTTTC CTGCAAGAAT 900 

ATCTACAGAC CTGACAAGTT TCTTCAGTGT GTGAAAAATC CTGAGGATTC ATCTTGCACA 960 

TCTGAGATC T GAG CCAGTCG CTGTGGTTGT TTTAGCTCCT TGACTCCTTG TGGTTTATGT 1020 

CATCATACAT GACTCAGCAT ACCTGCTGGT GCAGAGCTCA AGATTTTGGA GGGTCCTCCA 1080 

CAATAAGGTC AATGCCAGAG ACGGAAGCCT TTTTCCCCAA AGTCTTAAAA TAACTTATAT 1140 

CATCAGCATA CCTTTATTGT GATCTATCAA TAGTCAAGAA AAATTATTGT ATAAGATTAG 1200 
AATGAAAATT GTATGTTAAG TTACTTCCTT TAG 



1 11 21 31 41 51 

I I I I I I 

MANCEFSPVS GDKPCCRLSR RAQLCLGVSI LVLILWVLA VWPRWRQTW SGPGTTKRFP 
ETV1ARCVKY TETHPEMRHV DCQSVWDAFK GAFISKHPCN ITEEDYQPLM KLGTQTVPCH 
45 KILLWSRIKD LAHQFTQVQR DMFTLEDTLL GYLADDLTWC GEFNTSKINY QSCPDWRKDC 
SNNPVSVFWK TVSRRFAEAA CDWHVMLNG SRSKIFDKNS TFGSVEVHNL QPEKVQTLEA 
WVTHGGREDS RDLCQDPTIK ELESIISKRN 1QFSCKNIYR PDKFLQCVKN PEDSSCTSEI 

SEQ ID N0:37 PBH1 DNA SEQUENCE 

50 Nucleic Acid Accession*: XML017718 

Coding sequence: 1-3315 (underlined sequences correspond lo start and stop codons) 

1 11 21 31 41 51 

- c I I I I I i 

55 ATGTCCTTTC GGGCAGCCAG GCTCAGCATG AGGAACAGAA 
ACCCGGACCC TGTACTCCAG CG 
TTGGTGAATT TTATTCAAGC AAATTTTAAG AAACGAGAAI GTGTCTTCTT TACCAAAGAT 
TCCAAGGCCA CGGAGAATGT GTGCAAGTGT GGCTATGCCC AGAGCCAGCA CATGGAAGGC 
ffx ACCCAGATCA ACCAAAGTGA GAAATGGAAC TACAAGAAAC ACACCAAGGA ATTTCCTACC 
OU GACGCCTTTG GGGATATTCA GTTTGAGACA CTGGGGAAGA AAGGGAAGTA TATACGTCTG 
TCCTGCGACA CGG ACGCGG A AATCCTTTAC GAGCTGCTGA CCCAGCACTG GCACCTGAKA 
ACACCCAACC TGGTCATTTC TGTGACCGGG GGCGCCAAGA ACTTCGCCCT GAAGCCGCGC 
ATGCGCAAGA TCTTCAGCCG GCTCATCTAC ATCGCGCAGT CCAAAGGTGC TTGGATTCTC 
,_ ACGGGAGGCA CCCATTATGG CCTGATGAAG TACATCGGGG AGGTGGTGAG AGATAACACC 
05 ATCAGCAGGA GTTCAGAGGA GAATATTGTG GCCATTGGCA TAGCAGCTTG GGGCATGGTC 
TCCAACCGGG ACACCCTCAT CAGGAATTGC GATGCTGAGG GCTATTTTTT AGCCCAGTAC 
CTTATGGATG ACTTCACAAG AGATCCACTG TATATCCTGG ACAACAACCA CACACATTTG 

„ n CTAGAGAAGT ATATCTCTGA GCGCACTATT CAAGATTCCA ACTATGGTGG CAAGATCCCC 
70 ATTGTGTGTT TTGCCCAAGG AGGTGGAAAA GAGACTTTGA AAGCCATCAA TACCTCCATC 
AAAAATAAAA TTCCTTGTGT GGTGGTGGAA GGCTCGGGCC AGATCGCTGA TGTGATCGCT 
AGCCTGGTGG AGGTGGAGGA TGCCCTGACA TCTTCTGCCG TCAAGGAGAA GCTGGTGCGC 
TTTTTACCCC GCACGGTGTC CCGGCTGCCT GAGGAGGAGA CTGAGAGTTG GATCAAATCG 
_ _ CTCAAAGAAA TTCTCGAATG TTCTCACCTA TTAACAGTTA TTAAAATGGA AGAAGCTGGG 
75 GATGAAATTG TGAGCAATGC CATCTCCTAC GCTCTATACA AAGCCTTCAG CACCAGTGAG 
CAAGACAAGG ATAACTGGAA TGGGCAGCTG AAGCTTCTGC TGGAGTGGAA CCAGCTGGAC 
TTAGCCAATG ATGAGATTTT CACCAATGAC CGCCGATGGG AGTCTGCTGA CCTTCAAGAA 
GTCATGTTTA CGGCTCTCAT AAAGGACAGA CCCAAGTTTG TCCGCCTCTT TCTGGAGAAT 
orv GGCTTGAACC TACGGAAGTT TCTCACCCAT GATGTCCTCA CTGAACTCTT CTCCAACCAC 
80 TTCAGCACGC TTGTGTACCG GAATCTGCAG ATCGCCAAGA ATTCCTATAA TGATGCCCTC 
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CTCACGTTTG TCTGGAAACT GGTTGCGAAC TTCCGAAGAG GCTTCCGGAA GGAAGACAGA 1620 
AATGGCCGGG ACGAGATGGA CATAGAACTC CACGACGTGT CTCCTATTAC T 
CTGCAAGCTC TCTTCATCTG GGCCATTCTT CAGAATAAGA AGGAACTCTC C 
TGGGAGCAGA CCAGGGGCTG CACTCTGGCA GCCCTGGGAG C 
CTGGCCAAAG TGAAGAACGA CATCAATGCT GCTGGGGAGT O 
TACGAGACCC GGGCTGTTGA GCTGTTCACT GAGTGTTACA GCAGCGATGA A 

GAACAGCTGC TGGTCTATTC CTGTGAAGCT TGGGGTGGAA GCAACTGTCT GGAGCTGGCG 1980 

GTGGAGGCCA CAGACCAGCA TTTCATCGCC CAGCCTGGGG TCCAGAATTT TCTTTCTAAG 2010 

CAATGGTATG GAGAGATTTC CCGAGACACC AAGAACTGGA AGATTATCCT GTGTCTGTTT 2100 

ATTATACCCT TGGTGGGCTG TGGCTTTGTA TCATTTAGGA AGAAACCTGT CGACAAGCAC 2160 

AAGAAGCTGC TTTGGTACTA TGTGGCGTTC TTCACCTCCC CCTTCGTGGT CTTCTCCTGG 2220 

AATGTGGTCT TCTACATCGC CTTCCTCCTG CTGTTTGCCT ACGTGCTGCT CATGGATTTC 2280 

CATTCGGTGC CACACCCCCC CGAGCTGGTC CTGTACTCGC TGGTCTTTGT CCTCTTCTGT 2340 

GATGAAGTGA GACAGTGGTA CGTAAATGGG GTGAATTATT TTACTGACCT GTGGAATGTG 2400 

ATGGACACGC TGGGGCTTTT TTACTTCATA GCAGGAATTG TATTTCGGCT CCACTCTTCT 2460 

AATAAAAGCT CTTTGTATTC TGGACGAGTC ATTTTCTGTC TGGACTACAT TATTTTCACT 252 0 

CTAAGATTGA TCCACATTTT TACTGTAAGC AGAAACTTAG GACCCAAGAT TATAATGCTG 2580 

CAGAGGATGC TGATCGATGT GTTCTTCTTC CTGTTCCTCT TTGCGGTOTG GATGGTGGCC 2640 

TTTGGCGTGG CCAGGCAAGG GATCCTTAGG CAGAATGAGC AGCGCTGGAG GTGGATATTC 2700 

CGTTCGGTCA TCTACGAGCC CTACCTGGCC ATGTTCGGCC AGGTGCCCAG TGACGTGGAT 2760 

GGTACCACGT ATGACTTTGC CCACTGCACC TTCACTGGGA ATGAGTCCAA GCCACTGTGT 2820 

GTGGAGCTGG ATGAGCACAA CCTGCCCCGG TTCCCCGAGT GGATCACCAT CCCCCTGGTG 2880 

TGCATCTACA TGTTATCCAC CAACATCCTG CTGGTCAACC TGCTGGTCGC CATGTTTGGC 2940 

TACACGGTGG GCACCGTCCA GGAGAACAAT GACCAGGTCT GGAAGTTCCA GAGGTACTTC 3000 

CTGGTGCAGG AGTACTGCAG CCGCCTCAAT ATCCCCTTCC CCTTCATCGT CTTCGCTTAC 3060 

TTCTACATGG TGGTGAAGAA GTGCTTCAAG TGTTGCTGCA AGGAGAAAAA CATGGAGTCT 3120 

TCTGTCTGCT GTTTCAAAAA TGAAGACAAT GAGACTCTGG CATGGGAGGG TGTCATGAAG 3180 

GAAAACTACC TTGTCAAGAT CAACACAAAA GCCAACGACA CCTCAGAGGA AATGAGGCAT 3240 

CGATTTAGAC AACTGGATAC AAAGCTTAAT GATCTCAAGG GTCTTCTGAA AGAGATTGCT 3300 
AATAAAATCA AATGA 



1 11 21 31 41 51 

I I I I I I 

MSFRAARLSM RHRRNDTLDS TRTLYSSASR STDLSYSESD LVNFIQANFK KRECVFFTKD 60 

C GYAQSQHMEG TQINQSEKWN YKKHTKEFPT DAFGDIQFET LGKKGKYIRL 120 

i ELLTQHWHLX TPNLVISVTG GAKNFALKPR MRKIFSRLIY IAQSKGAWIL 180 

TGGTHYGLMK YIGEWRDNT ISRSSEENIV AIGIAAWGMV SNRDTLIRNC DAEGYFLAQY 240 

LMDDFTRDPL YILDNNHTHL LLVDNGCHGH FTVEAKLRNQ LEKYISERTI QDSNYGGKIP 300 

IVCFAQGGGK ETLKAINTSI KHKIPCVWE GSGQIADVIA SLVEVEDALT SSAVKEKLVR 360 

FLPRTVSRLP EEETESWIKW LKEILECSHL LTVTKMEEAG DEIVSNAISY ALYKAFSTSE 420 

QDKDNWNGQL KLLLEWNQLD LANDEIFTND RRWESADLQE VMFTALIKDR PKFVRLFLEN 480 

GLNLRKFLTH DVLTELFSNH FSTLVYRNLQ IAKHSYNDAL LTFVWKLVAN FRRGFRKEDR S40 

NGRDEMDIEL HDVSPITRHP LQALFIWAIL QNKKELSKVI WEQTRGCTLA ALGASKLLKT 600 

IAKVKHDIHA AGESEELANE YETRAVELFT ECYSSDEDLA EQLLVYSCEA WGGSNCLELA 660 

VEATDQHFIA QPGVQNFLSK QWYGEISRDT KNWKIILCLF IIPLVGCGFV SFRKKPVDKH 720 

KKLLWYYVAF FTSPFWFSW NWFYIAFLL LFAYVLLMDF HSVPHPPELV LYSLVFVLFC 780 

DEVRQWYVNG VNYFTDLWMV MDTLGLFYFI AGIVFRLHSE NKSSLYSGRV IFCLDYIIFT 840 

LRLIHIFTVS RNLGPKIIML QRMLIDVFFF LFLFAVWMVA FGVARQGILR QNEQRWRWIF 900 

RSVIYEPYLA HFGQVPSDVD GTTYDFAHCT FTGNESKPLC VELDEHNLPR FPEWITIPLV 960 

CIYMLETNIL LVNLLVAMFG YTVGTVQENN DQVWKFQRYF LVQEYCSRtN IPFPF1VFAY 1020 

FYMWKKCFK CCCKEKNMES SVCCFKKEDN ETLAWEGVMK EHYLVKIWTK MTOTSEEKRH 1080 
RFRQLDTKLN DLKGLLKEIA NKIK 

SEQ ID NO:39 P8H3 DNASEQUENCE 

Nucleic Acid Accession*: XMJH1804 

Coding sequence: 1-558 (underlined sequences correspond to start and slopcodons) 



ATCCCTCGCC TGTTCTTGTT CCACCTGCTA GAATTCTGTT TACTACTGAA CCAATTTTCC 
- _ AGAGCAGTCG CGGCCAAATG GAAGGACGAT GTTATTAAAT TATGCGGCCG CGAATTAGTT 

65 CGCGCGCAGA TTCCCATTTG CGGCATGAGC ACCTGGAGCA AAAGGTCTCT GAGCCAGGAA 

GATGCTCCTC AGACACCTAG ACCAGTGGCA GAAATTGTAC CATCCTTCAT CAACAAAGAT 
ACAGAAACTA TAATTATCAT GTTGGAATTC ATTGCTAATT TGCCACCGGA GCTGAAGGCA 
GCCCTATCTG AGAGGCAACC ATCATTACCA GAGCTACAGC AGTATGTACC TGCATTAAAG 
_„ GATTCCAATC TTAGCTTTGA AGAATTTAAG AAACTTATTC GCAATAGGCA AAGTGAAGCC 

70 GCAGACAGCA ATCCTTCAGA ATTAAAATAC TTAGGCTTGG ATACTCATTC TCAAAAAAAG 

AGACGACCCT ACGTGGCACT GTTTGAGAAA TGTTGCCTAA TTGGTTGTAC CAAAAGG1CT 
CTTGCTAAAT ATTG CTGA 

SEP ID NO:40 PBH3 PRQTETN SEQUENCE 

75 Protein Accession*: NP_O0B842 
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SEQ ID N0:41 PBH5 DNA SEQUENCE 

Nucleic Acid Accession*: NM_005845 

Coding sequence: 1-3978 (undented sequences correspond to start and slop codons) 

1 11 21 31 41 51 

I I I I I I 

ATSCTGCCCG TGTACCAGGA GGTGAAGCCC AACCCGCTGC AGGACGCGAA CCTCTGCTCA 
CGCGTGTTCT TCTGGTGGCT CAATCCCTTG TTTAAAATTG GCCATAAACG GAGATTAGAG 
GAAGATGATA TGTATTCAGT GCTGCCAGAA GACCGCTCAC AGCACCTTGG AGAGGAGTTG 
CAAGGGTTCT GGGATAAAGA AGTTTTAAGA GCTGAGAATG ACGCACAGAA GCCTTCTTTA 
ACAAGAGCAA TCATAAAGTG TTACTGGAAA TCTTATTTAG TTTTGGGAAT TTTTACGTTA 
ATTGAGGAAA GTGCCAAAGT AATCCAGCCC ATATTTTTGG GAAAAATTAT TAATTATTTT 
GAAAATTATG ATCCCATGGA T 
CTGACTTTTT GCACGCTCAT TTTGGGTATA 
TGTGCTGGGA TGAGGTTACG A 
CTTAGTAACA TGGCCATGGG G 

GATGTGAACA AGTTTGATCA GGW3ACAGTG TTCTTACACT TCCTGTGGGC AGGACCACTG 
CAGGCGATCG CAGTGACTGC CCTACTCTGG ATGGAGATAG GAATATCGTG CCTTGCTGGG 
ATGGCAGTTC TAATCATTCT CCTGCCCTTG CAAAGCTGTT TTGGGAAGTT GTTCTCATCA 
CTGAGGAGTA AAACTGCAAC TTTCACGGAT GCCAGGATCA GGACCATGAA TGAAGTTATA 
ACTGGTATAA GGATAATAAA AATGTACGCC TGGGAAAAGT CATTTTCAAA T 



15 
20 
25 
30 
35 

GAGGATGGTG ATCTGACTGT GATAGGAGAT CGGGGAACCA CGCTGAGTGG AGGGCAGAAA 1620 
GCACGGGTAA ACCTTGCAAG AGCAGTGTAT CAAGATGCTG ACATCTATCT CCTGGACGAT 
CCTCTCAGTG CAGTAGATGC GGAAGTTAGC AGACACTTGT TCGAACTGTG TATTTGTCAA 
4U ATTTTGCATG AGAAGATCAC AATTTTAGTG ACTCATCAGT TGCAGTACCT CAAAGCTGCA 
AGTCAGATTC TGATATTGAA AGATGGTAAA ATGGTGCAGA AGGGGACTTA CACTGAGTTC 
CTAAAATCTG GTATAGATTT TGGCTCCCTT TTAAAGAAGG ATAATGAGGA AAGTGAACAA 

cctccagttc caggaactcc cacactaagg aatcgtacct tctcagagtc TTCGGTTTGG 

_ TCTCAACAAT CTTCTAGACC CTCCTTGAAA GATGGTGCTC TGGAGAGCCA AGATACAGAG 2040 

45 AATGTCCCAG TTACACTATC AGAGGAGAAC CGTTCTGAAG GAAAAGTTGG TTTTCAGGCC 21 00 

TATAAGAATT ACTTCAGAGC TGGTGCTCAC TGGATTGTCT TCATTTTCCT TATTCTCCTA 2160 

AACACTGCAG CTCAGGTTGC CTATGTGCTT CAAGATTGGT GGCTTTCATA CTGGGCAAAC 2220 

AAACAAAGTA TGCTAAATGT CACTGTAAAT GGAGGAGGAA ATGTAACCGA GAAGCTAGAT 2280 

CTTAACTGGT ACTTAGGAAT TTATTCAGGT TTAACTGTAG CTACCGTTCT TTTTGGCATA 2340 

50 GCAAGATCTC TATTGGTATT CTACGTCCTT GTTAACTC1T CACAAACITT GCACAACAAA 2400 

ATGTTTGAGT CAATTCTGAA AGCTCCGGTA TTATTCTTTG AT AG AAATC C AATAGGAAGA 2460 

ATTTTAAATC GTTTCTCCAA AGACATTGGA CACTTGGATG ATTTGCTGCC GCTGACGTTT 2520 

TTAGATTTCA TCCAGACATT GCTACAAGTG GTTGGTGTGG TCTCTGTGGC TGTGGCCGTG 2580 

2640 




GAAATACGAA 



3000 



GCTGAAGTTG AGAATATGAT GATCTCAGTA GAAAGGGTCA TTGAATACAC AGACCTTGAA 3060 

AAAGAAGCAC CTTGGGAATA TCAGAAACGC CCACCACCAG CCTGGCCCCA TGAAGGAGTG 3120 

ATAATCTTTG ACAATGTGAA CTTCATGTAC AGTCCAGGTG GGCCTCTGGT ACTGAAGCAT 3180 

CTGACAGCAC TCATTAAATC ACAAGAAAAG GTTGGCATTG TGGGAAGAAC CGGAGCTGGA 3240 

65 AAAAGTTCCC TCATCTCAGC CCTTTTTAGA TTGTCAGAAC CCGAAGGTAA AATTTGGATT 3300 

GATAAGATCT tgacaactga aattggactt cacgatttaa ggaagaaaat g 

CCTCAGGAAC CTGTTTTGTT CACTGGAACA ATGAGGAAAA ACCTGGATCC C 

CACACGGATG aggaactgtg gaatgcctta caagaggtac aacttaaaga aaccattgaa 34B0 

GATCTTCCTG GTAAAATGGA TACTGAATTA GCAGAATCAG GATCCAATTT TAGTGTTGGA 3540 

70 CAAAGACAAC TGGTGTGCCT TGCCAGGGCA ATTCTCAGGA AAAATCAGAT ATTGATTATT 36 00 

GATGAAGCGA CGGCAAATGT GGATCCAAGA ACTGATGAGT TAATACAAAA AAAAATCCGG 3660 

GAGAAATTTG CCCACTGCAC CGTGCTAACC ATTGCACACA GATTGAACAC CATTATTGAC 3720 

AGCGACAAGA TAATGGTTTT AGATTCAGGA AGACTGAAAG AATATGATGA GCCGTATGTT 3780 

TTGCTGCAAA ATAAAGAGAG CCTATTTTAC AAGATGGTGC AACAACTGGG CAAGGCAGAA 3840 

75 GCCGCTGCCC TCACTGAAAC AGCAAAACAG GTATACTTCA AAAGAAATTA TCCACATATT 3900 

GGTCACACTG ACCACATGGT TACAAACACT TCCAATGGAC AGCCCTCGAC CTTAACTATT 3960 
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SEP ID NO;42 PBH5 PROTEIN SEQUENCE 

Protein Accession*: NP.005836 

1 11 21 31 41 51 

I I I I I I 

MLPVYQEVKP NPLQDANLCS RVFFWWLNPL FKIGHKRRLE EDDMYSVLPE DRSQHLGEEL 60 

QGFWDKEVLR AENDAQKPSL TRAIIKCYWK SYLVLGIFTL IEESAKVIQP IFLGKIINYF 12 0 

ENYDPMDSVA LNTAYAYATV LTFCTLILAI LHHLYFYHVQ CAGMRLRVAK CHMIYRKALR 180 

LSNMAMGKTT TGQIUNLLSN DVHKFDQVTV FLHFLWAGPL QAIAVTALLW MEIGISCI.AG 240 

MAVLIILLPL QSCFGKLFSS LRSKTATFTD ARIRTMNBVI TGIRIIKMYA WEKSFSNLIT 300 

NLRKKEISKI LRSSCLRGMN LASFFSASKI IVFVTFTTYV LLGSVITASR VFVAVT L YGA 360 

VRLTVTLFFP SAIERVSEAI VSIRRIQTFL LLDEISQRNR QLPSDGKKMV HVQDFTAFWD 420 

KASETPTLQG LSFTVRPGEL LAWGPVGAG KSSLLSAVLG ELAPSHGLVS VHGRIAYVSQ 480 

QPWVFSGTLR SNILFGKKYE KERYEKVIKA CALKKDLQLL EDGDLTV1GD RGTTLSGGQK S40 

ARVNLARAVY QDADIYLLDD PLSAVDAEVS RHLFELCICQ ILHEKITILV THQLQYLKAA 600 

SQXIiILKDGK MVQKGTYTEF LKSGIDFGSL LKKDNEESEQ PPVPGTPTLR NRTFSESSVW 660 

SQQSSRPSLK DGALESQDTE NVPVTLSEEN RSEGKVGFQA YKNYFRAGAH WIVFIFLILL 720 

NTAAQVAYVL QDWWLSYWAN KQSMLNVTVN GGGNVTEKLD LNWYLGIYSG LTVATVLFGI 780 

ARSLLVFYVL WSSQTLHHK MFESILKAFV LFFDRNPIGR ILNRFSKDIG HLDDLLPLTF 840 

LDFIGTLLQV VGWSVAVAV IPWIAIPLVP LGIIFIFLRR YFLETSRDVK RLESTTRSPV 900 

FSHLSSSLQG LWTIRAYKAE ERCQELFDAH QDLHSEAWFL FLTTSRWFAV RLDAICAMFV 960 

IIVAFGSLIL AKTLDAGQVG LALSYALTLM GMFOWCVRQS AEVENMMISV ERVIEYTDLE 1020 

KEAPWEYQKR PPPAWPHEGV IIFDNVNFMY SPGGPLVLKH LTALIKSQEK VGIVGRTGAG 1080 

KSSLISALFR LSEPEGKIWI DKILTTEIGL HDLRKKMSII PQEPVLFTGT MRKNMPFNE 1140 

HTDEELWHAL QEVQLKETIE DLPGKMDTEL AESGSNFSVG QRQLVCLARA ILRKNQILII 1200 



ATGGGGAAAG TGTCCTGCTG TGGCATGAAA TAAATGAAAC AGAAAATGAT Gl 
CTAAGAACAT CCTTTGCTTT GCTCTTCCTT GGCCTCTTTG GGGTGCTGGG GGCAGCAACA 
k GAAATGAAGA AGGGAAAGCT GTGGACTGGT T 



30 SEQ ID NO:43 PBQ7 DNA SEQUENCE 

Nucleic Acid Accession*: NMJ21233 

Coding sequence: 34-1 1 1 9 (underlined sequences correspond to start and stop cote) 

35 
40 

TTACTGCTGT GGAACAGAGT TCAAGGGTTC TGGCTGATTC ATTCCATCCC T 
CCAATTCCGG AAGAAGGCTA TGATTATCCA CCCACAGGGA GACGAAATGG ACAAAGTGGC 

45 ATCTGCATAA CTTTCAAGTA CAACCAGTAT GAGGCAATAG ATTCTCAGCT CTTGGTCTGC 
AACCCCAACG TCTATAGCTG CTCCATCCCA GCCACCTTTC ACCAGGAGCT CATTCACATG 
CCCCAGCTGT GCACCAGGGC CAGCTCATCA GAGATTCCTG GCAGGCTCCT CACCACACTT 
CAGTCGGCCC AGGGACAAAA ATTCCTCCAT TTTGCAAAGT CGGATTCTTT TCTTGACGAC 
ATCTTTGCAG CCTGGATGGC TCAACGGCTO AAGACACACT TGTTAACAGA AACCTGGCAG 

50 CGAAAAAGAC AAGAGCTTCC TTCAAACTGC TCCCTTCCTT ACCATGTCTA CAATATAAAA 
GCAATTAAAT TATCACGACA CTCTTATTTC AGTTCTTATC AAGATCACGC C 

ATTTCCCAAA AGGGCACCAA AAATCGCTGG ACATGTATTG GAGACCTAAA TCGGAGTCCA 1020 




60 

GHTKGLLLWN RVQGFWLIHS IPQFPPIPEE GYDYPPTGRH NGQSGICITF KYNQYEAIDS 
QLLVCNPNVY SCSIPATFHQ ELIHMPQLCT RASSSEIPGR LLTTLQSAQG QKFLHFAKSD 
05 SFLDDIFAAW MAQRLKTHLL TETWQRKRQE LPSNCSLPYH VYNIKAIKLS RHSYFSSYQD 

HAKWCISQKG TKNRWTCIGD LNRSPHQAFR SGGFICTQNW QIYQAFQGLV LYYESCK 

SEQ ID N0:45 PCQ8 DNA SEQUENCE 

Nucleic Acid Accession*: XMJB0453 
70 Coding sequence: 89-1273 (underlined sequences correspond to start and stop codons) 



CATCCGCTAC ATGATAGGAG AGATTCAATA TGGAGGCAGA GTCACTGACG ACTATGATAA 180 

GAGATTGTTG AACACATTTG CTAAGGTTTG GTTCAGTGAA AATATGTTTG GACCAGATTT 240 

CAGTTTTTAC CAAGGATACA ATATTCCAAA ATGCAGCACA GTGGATAACT ATCTTCAGTA 300 

TATCCAGAGT TTGCCTGCCT ATGACAGCCC TGAGGTGTTT GGGCTGCACC CCAATGCTGA 360 



317 



WO 02/30268 



PCT/US01/32045 



CATCACCTAC CAGAGCAAGC 
CAAGGACACC TCTGGTGGAG 
TGATATGCTG GAGAAGCTGC 

GAAGATGGGG CCATTCCAGC CTATGAACAT TTTCCTCAGG CAGGAAATAG ACAGAATGCA 
AAGGGTACTC AGCCTTGTCC GCAGCACCCT CACTGAGCTG AAACTTGCTA TTGATGGCAC 
CATCATCATG AGCGAAAATC TGCAAGATGC ATTGGATTGC ATGTTTGATG CTAGAATCCC 
TGCTTGGTGG AAAAAAGCTT CTTGGGTTTT TAGTACACTG GGTTTCTGGT TTACTGAACT 
TATAGAAAGA AACAGCCAGT TTACCTCGTG GGTTTTCAAT GGCCGACCTC ACTGCTTTTG 
GATGACGGGT TTTTTTAACC CCCAGGGATT TTTAACTGCA ATGCGACAGG AAATAACTCG 
GGCCAACAAA GGCTGGGCTC TGGACAATAT GGTGCTTTGC AATGAAGTCA CCAAATGGAT 
GAAGGACGAC ATTTCTACCC CTCCCACAGA GGGTGTCTAT GTCTATGGCT TATATCTTGA 
AGGTGCTGGC TGGGACAAGA GGAACATGAA ACTCATTGAA TCAAAGCCAA AAGTGCTCTT 
TGAGTTGATG CCTGTCATAA GGATTTATGC . 
TTACTCCTGT CCCATCTATA 

GGATCTCAGG ACAGCCCAGA CCCCTGAACA CTGGGTGCTC CGTGGGGTTG CCCTTCTGTG 
TGATGTCAAG TAACATGTGG GGAGTGTCCC CACCCAATGC TTTGGAAAAT I 
AATTATTGTA ACCTTTATTT CTGTATGACT GCTGGACAGT GTATGTTAGG 1 
AATTAATGAG CTGCATAGGT TTTCCCCACT CCTTAATTGG ATGCTTATAT TTTACTTGTT 

ATTTACAGCA TCCTAATGAA GTGTGGCCCT CAAATCCACA GTAGTATATT TTCTTCTTAC 1560 
TTCGCTCCGA AGACTGACTG TGATTATAAC AGCAAATATA TTTGCATGTG GACAAAGATT 1620 
AGATGGCAAG ATAGAAAAAT AAGAACAGAT GTGATAGCAA GAATTATAGT TGGCTTGAAA : 
AAATGTGATG ATCAGGAGAA AAAATAAAAA AAGGGTAGAA ATATTAGACG GTGCGTAGGG 
ACTTTCTATG GACTTTTATT AATTAGGAAA CATTATCAAA GGAACTTTTC ACGTATTTTT 
CTTTAAATTC TGGTTAGATG TTATTAATAA TTCTTCATCT AACCTACTGA CTAGAAAATA 
TAGTCAGTAC TAAATTA3AA TTGTGGTTTA TAAACTTTTG GTTAGCTCTG GATCTGTATA 
ACTGCATTTT TTTGGATAAA CAGTTTTTGG TAGGTGGATA CCGGGAGACA AGTGTGGGTC . 
CCTCTCACTG GGCTTCATTC TGTGGACCAG GATCATTATT TCATGCTCAT GATCATGAGA 2040 
GTTAGGACTG AGTGGCTCCT GTGACTCCCA CCATCTTAGA TGATACTGTT TTCTTGTGAG 21D0 
TTCTTTCTTT TGGTCTGGAT TAGTATATCA GTTGATOTGT GTGAATTGTG GTGAAACAAT 2160 
CATTTCATTT TGAAAAGCAA GTAATGAAAA TGTCAGCATC ATAGGAATTA ATAAAATGTT 2220 
TTTACTAAAA AAAAAAAAAA AAA 



1 11 21 31 41 51 

I I I 1 I I 

MDVKKGVSWT TIRYMIGEIQ YGGRVTDDYD KRLLNTFAKV WFSENMFGPD FSFYQGYNIP 
KCSTVDNYLQ YIQSLPAYDS PEVFGLHPNA DITYQSKLAK DVLDTILGIQ PKDTSGGGDE 
TREAWARLA DDMLEKLPPD YVPFEVKERL QKMGPFQPMN IFLRQEIDRM QRVLSLVRST 
LTELKLAIDG TIIMSENLQD ALDCMFDARI PAWWKKASWV FSTLGFWFTE LIEPNSQFTS 
WVFNGRPHCF WMTGFFNPQG FLTAMRQEIT RANKGW* 
EGVYVYGLYL EGAGWDKRNM KLIESKPKVL FELMEVIRIY A 
VRTDLN YIAA VDLRTAQTPE HWVLRGVALL CDVK 

SEQ ID NO:47 PDG5 DNA SEQUENCE 

Nucleic Acid Accession #: AB033036 

Coding sequence: 68-3349 (underfilled sequences correspond lo start and stop codons) 



T ACAACTTCAC AACCAGAAAC CACTACCCCT CAGGGGTTGC TTTCAGATAA 60 

AGATGACATG GGAAGGAGAA ATGCTGGCAT AGATTTCGGA TCCAGAAAAG CATCAGCAGC 120 

ACAGCCCATA CCTGAAAACA TGGACAATTC CATGGTTAGT GATCCACAAC CATACCATGA . 180 

AGATGCAGCT TCTGGAGCTG AGAAGACAGA AGCCAGAGCT TCTCTCTCAC TGATGGTGGA 240 

AAGCCTTTCT ACAACCCAAG AGGAGGCCAT TCTCTCAGTA GCAGCAGAGG CTCAGGTGTT 300 

TATGAATCCT TCTCATATCC AGTTAGAAGA TCAAGAAGCT TTCAGCTTTG ATTTACAAAA 360 

GGCCCAATCC AAAATGGAGT CAGCCCAGGA TGTTCAAACT ATCTGCAAAG AAAAGCCTTC 420 

TGGAAATGTT CACCAGACCT TTACAGCAAG TGTTTTGGGT ATGACAAGTA CTACAGCCAA 480 

AGGAGATGTT TATGCCAAGA CTCTGCCTCC CAGAAGCCTT TTTCAGTCCT CAAGGAAGCC 540 

TGATGCTGAA GAAGTCTCCT CAGATTCAGA GAATATTCCT GAGGAGGGGG ATGGTTCTGA 600 

AGAACTGGCT CATGGTCACT CTTCCCAGTC CTTGGGGAAG TTTGAAGATG AACAAGAAGT 660 

CTTCTCAGAA TCAAAAAGTT TTGTTGAGGA CTTGAGCAGC TCTGAGGAGG AGCTGGACCT 720 

CAGATGCCTC TCCCAGGCTT TAGAGGAGCC TGAAGATGCA GAAGTCTTCA CAGAATCAAG 780 

CAGTTATGTT GAAAAGTACA ACACTTCTGA TGATTGCAGC AGCTCAGAGG AAGACCTGCC B40 

TCTCAGACAC CCTGCTCAGG CCTTGGGAAA GCCCAAAAAC CAACAAGAAG TCTCCTCTGC 900 

TTCAAATAAT ACTCCTGAAG AGCAGAATGA TTTTATGCAG CAGCTGCCTT CCAGATGCCC 960 

TTCTCAGCCC ATTATGAATC CTACTGTTCA GCAACAAGTC CCCACCAGTT CAGTGGGCAC 1020 

TTCTATAAAA CAGAGCGATT CCGTGGAGCC AATCCCTCCA AGACACCCTT TCCAGCCATG 1080 

GGTGAACCCT AAAGTGGAGC AAGAAGTTTC CTCATCTCCA AAGAGCATGG CTGTTGAAGA 1140 

GAGCATTTCT ATGAAGCCTC TGCCTCCTAA ACTTCTTTGC CAGCCCTTGA TGAATCCTAA 1200 

AGTTCAACAA AACATGTTCT CAGGTTCAGA GGACATTGCT GTTGAGAGAG TCATTTCTGT 1260 

GGAGCCACTA CTCCCCAGAT ATTCTCCTCA GTCCTTGACA GATCCTCAAA TCCGGCAAAT 1320 

CTCAGAAAGC ACAGCTGTTG AGGAAGGCAC TTATGTGGAA CCGCTGCCTC CCAGATGCCT 1380 

TTCCCAGCCC TCGGAGAGGC CTAAGTTCCT GGACTCAATG AGTACTTCTG CAGAATGGAG 1440 

CAGTCCTGTG GCACCAACAC CTTCCAAATA CACTTCCCCG CCATGGGTGA CCCCTAAATT 1500 

TGAGGAACTG TATCAACTCT CTGCACATCC AGAAAGCACT ACTCTTGAAG AGGACATTTC 1560 

TAAGGAGCAG CTGCTTCCCA GACATCTTTC CCAGTTGACT GTGGGAAATA AAGTCCAGCA 1620 

ACTGTCCTCA AATTTCGAGC GGGCTGCTAT TGAGGCAGAC ATTTCTGGGA GTCCATTGCC 1680 
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TCCCCAATAT GCTACCCAGT TCTTAAAGAG GTCTAAAGTT CAGSAAATGA CCTCACGACT 1740 

AGAGAAAATG GCTGTTGAAG GCACTTCTAA CAAATCACCG ATTCCCAGGC GTCCGACCCA 1800 

GTCATTCGTG AAATTTATGG CACAGCAAAT CTTTTCAGAG AGCTCTGCTC TTAAGAGGGG 1860 

CAGTGATGTG GCACCTCTGC CTCCCAATCT TCCTTCCAAA TCTTTATCAA AGCCTGAAGT 1920 

CAAGCACCAA GTTTTCTCAG ATTCAGGGAG TGCTAATCCT AAGGGAGGCA TTTCTTCAAA 1980 

GATGCTACCT ATGAAGCACC CTTTACAGTC CTTGGGGAG3 CCTGAAGACC CACAGAAAGT 2040 

TTTCTCTTAT TCAGAGAGAG CTCCTGGGAA GTGCAGCAGT TTTAAAGAGC AGCTGTCTCC 2100 

CAGGCAGCTT TCCCAGGCCT TGAGGAAACC TGAGTATGAG CAAAAAGTCT CCCCTGTTTC 2160 

TGCCAGTTCT CCTAAAGAGT GGAGGAATTC TAAAAAGCAG CKCCTCCCA AACATTCTTC 2220 

CCAAGCCTCA GATAGGTCTA AATTCCAGCC ACAGATGTCA TCAAAGGGCC CAGTGAATGT 2280 

ACCTGTAAAG CAGAGCAGCG GTGAGAAGCA CCTGCCTTCA AGTAGTCCTT TCCAGCAACA 2340 

GGTTCATTCA AGTTCTGTGA ATGCTGCTGC TAGGCGATCT GTTTTTGAGA GCAATTCTGA 2400 

CAATTGGTTC CTAGGAAGAG ATGAAGCTTT TGCAATCAAA ACCAAGAAAT TCAGCCAAGG 2460 

TTCCAAAAAC CCCATAAAGA GCATTCCAGC CCCTGCTACC AAACCTGGGA AGTTCACCAT 2520 

C AGGCAAACAT CCACTTCTGG GGGCATTTAC TCTAAGAAAG AAGATCTTGA 2580 

^ATAATA ACCAGCATGC AAACCTATCC AATCAGGATG ATGTTGAAAA 2640 

A GTTCGACTGA AAAGAGCCCC TCCCTCGCAG AAGTATAAGA GTGAGAAACA 2700 

AGATAACTTC ACCCAGCTTG CTTCAGTGCC CTCGGGCCCA ATTTCATCCT CTCTAGGCAG 2760 

GGGACATAAA ATCAGAAGCA CTTCCCAGGG GCTCCTGGAT GCTGCAGGGA ACCTCACCAA 2820 

AATATCTTAC GTTGCAGATA AGCAACAGAG CAGGCCCAAA TCTGAAAGCA TGGCCAAGAA 2880 

GCAACCTGCT TGCAAGACCC CAGGAAAGCC TGCTGGTCAA CAGTCAGATT ATGCTGTCTC 2940 

AGAGCCGGTT TGGATAACTA TGGCAAAGCA GAAGCAGAAG AGTTTCAAGG CCCACATTTC 3000 

TGTGAAAGAG CTGAAAACTA AGAGCAATGC TGGAGCCGAT GCTGAGACTA AGGAGCCTAA 3060 

ATATGAGGGA GCTGGCTCTG CAAATGAAAA CCAACCTAAA AAGATGTTCA CTTCCAGTGT 3120 

CCATAAACAG GAGAAGACAG CACAGATGAA GCCACCTAAG CCTACAAAAT CAGTTGGATT 3180 

TGAAGCTCAG AAGATACTGC AAGTTCCTGC CATGGAAAAA GAAACCAAAC GATCTTCAAC 3240 

TCTCCCAGCC AAGTTCCAGA ACCCAGTTGA GCCAATTGAG CCTGTCTGGT TCTCACTGGC 3300 

CAGGAAGAAA GCCAAAGCAT GGAGCCACAT GGCAGAAATC ACGCAATAAA GAGCTCTTGT 3360 

GTGGAGCATC AGCATTTATT TTATTTAGTT TTTTTTTTTT TTTTTTTTTT GAGACAGAGT 3420 

CTCGCTCTGT TACCCAGATT GGAGTGCAGT GGCGCGATCT CCGCTCACTG CAAGCTCCGC 3480 

CTCCCGGGTT CACGCCACTC TCCCGCCTCA GTCTCCCGAC TAGCTGGGAC TACAGGCGCC 3540 

CGCCATCACG CCCGGCTAAT TTTGTTTTCG TATTTTTAGT AGAGACGGGG TTTCACCATG 3600 

TTGGCCAGGA TGGTCTTGAT CTCGTGACCT CGTGATCCGC CCGCCTCAGC CTCCCAAAAG 3660 

CTGGGATTAC AGGCGTGAGC CACCGCGCCC GGCCAAGCAT CAGCGTTTTA AATGATAATT 3720 

GCTAATAGCT GTATTAATTC TATGTAGTGA TCTTTTTACT GTGACCACTT GTATTAAGCA 3780 
AAATAAGTAT TAAGCAAACT AAGAATTTAT TAAGCAAAAT AAGAATTTAT TAAGCAAAAT 
AGCCTTAGAA ATGCAAATTA AAACATAATT ATTTGAATGA AATAAATGCC ATGAATGCTT 



AAGGGAGAAA GCAAAAACAT TTTGTTGGAG CAACTAGAAA ATTGTCATTT CCCTCAACCA 4080 

AATAAAGTAA TTCTAATGGA AACATTCAGA TGATTTGACC TAAAGATTGG CCTTTAGGTT 4140 

TTATGAGCCT AGATAGATGC CGCAATTATT TGGTTGTTGC TCTAAGCTTT GCAAGGGATC 4200 

CTAAAAGAGG CGGTGGAAGT GAAAATTCTG GGTCTCCAAG AAAATTTCTG CACAGCCAGT 4260 

TCTCCAATCA GCCTATCACC CCTTGAAACA TCTTCCCTGT GTCCCTGGGG GCCCCTGATG 4320 

CTTTCTCCTT GGGTGATAGT AACATGCAGA GCACTTACAC AAAGCTCCCT CTTTGGACAT 4380 

ACCCCACGTC GACCTGTCAC AGGCCTGGCT GTAGCGAGCA CCTCCCTATG ACGCAGAATG 4440 

CTTCTTGGGA ATTATCTTAC TCCTCTGGAG GGTTAGTCCA TCAATGTTTT GCTTCTTGTC 4500 

CCAATACTAC TGTGACCCTC TCTGATCGCA CAGAAATCAC TGCCTATCAC ATATATCCTG 4560 

TTAAGCACTG AAGACCCTAT TGAAATTAGA GTTCTACAGA TGCCAAAAGC TGTACTTTCC 4620 

ATCAGGCAGA TGGCAAGCTT ACTGCCTTGA TGCACATCTG GAGCCACTGG AGCTCCWCC 4680 

TCTCTGGTTC CAGCATTAAG GTGGAGAACT CCATGTAGCT TCTTGTCCTT TCCCCTCAGC 4740 

TGTCTTTGCT TCACAAGGTT TTAGCCCAAA GCAAGAGTGC AATCCCAAAG CCACAGAGAA 4800 

C CGCTACCTGG AAGCTTTAAG TGAGTAAATC AGCTTTTCCC CTCTCATTCC 4B60 

C TGGAGAGACC CTACCTTCCA GTGACCCACT 4920 

5040 
5100 

G ATGTAGGCAC CTTTACCCCC AGAGCCAGTT CCTTAATTGG CTGGCTTTOT 5160 

GAGATGCAGA CCACCCTAGA ATCTCATCTA GGTTCACTAG AAGTTAGTTA AATCTTCCTT 5220 

TCTCTGTCTT TCTCTTCATT CCATCCCCCA AACCCACCAA ACACTAAGGG AGAGCTCCCT 5280 

TTGGATGTCT GGGCAGTAAA CCTAGCTCAT TTTTCTAGGA GACCCAGAAG TGACTTCTGA 5340 

GTAGTTATCA CTGTGTCTGC CTCTGTTACA CTGTGCTGCT TTGCTTAAAC AGAAATGCAG 5400 

GCCTGGACAT CTGACTGTGC CTTTATATTC TGAGTGGGGT GCT3CCCCAT GCAAAAAAAT 5460 

CCAGAGAGGT AGTGAGGTGT CAGAGCTAAA CACTTGGTGC TGGGTTTTGT TGATGCTGGT 5520 

ATAATGTGAC ACAGTACAAT TACATGCTAA ATTTTGCATT TTCTCTATAT AACATCTATT 5580 

TTTCCTGATA CTGTGCCTTT GCCATTTTGA TAATGCTATT TTGATTGAGT GAATTTTATT 5640 

TCCTTTGTAT TCCCATAGTG AACAATATAT TAAGGTAGAT GCCCTTTATC TGGGTACTCC 5700 

TGGTAGATTA GCTGTTACAC CTCCCTTCCC TTTTTTACAG TGAACCTGTA TTCAGTTATT 5760 

GTCACTCTGA GAACTCTCCA ATAACAATTT CTTTTCCACA GTTAACAACA CAGCTGTTAC 5820 

ACCTCCCTTC CTTTTTTTCA CAGTGAACCT GTATTCAGCT ATTCTCACTC TGAGAACTCT S880 

CCAATAACAA TTTCTTTTCC ACAGTTAACA ACAAAGTTCT GTTTTTAAAT GAAGAGATTA 5940 

AGTTCTTTTT AAATGCCTAA AGGCATATTC TGACAACTTT TCTACTTCTT TAACTTTTTT 6000 
GATTTAAGAT ATATGCAAAG CAAATAAATT CAATAAAGCC T 
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DAASGAEKTE ARASLSLMVE SLSTTQEEAI LSVAAEAQVF MNPSHIQLED QEAFSFDLOK 
AQSKMESAQD VQTICKEKPS GNVHQTFTAS VLGMTSTTAK GDVYAKTLPP RSLFQSSRKP 
DAEEVSSDSE NIPEEGDGSE ELAHGHSSQS LGKFEDEQEV FSESKSFVED LSSSEEELDL 
RCLSQALEEP EDAEVFTESS SYVEKYNTSD DCSSSEEDLP LRHPAQALGK PKNQQEVSSA 
O SNNTPEEQND FMQQLPSRCP SQPIMHPTVQ QQVPTSSVGT SIKQSDSVEP IPPRHPFQPW 
VNPKVEQEVS SSPKSMAVEE SISMKPLPPK LLCQPLMNPK VQQNMFSGSE DIAVERVISV 
EPLLPRYSPQ SLTDPQIRQI SESTAVEEGT YVEPLPPRCL SQPSERPKFL DSMSTSAEWS 
SPVAPTPSKY TSPPWVTPKF EELYQLSAHP ESTTVEEDIS KEQLLPRHLS QLTVGNKVQQ 
LSSNFERAAI EADISGSPLP PQYATQFLKR SKVQEMTSRL EKMAVEGTSN KSPIPRRPTQ 

10 SFVKFMAQQI FSESSALKRG SDVAPLPPNL FSKSLSKPEV KHQVFSDSGS ANPKGGISSK 
MLPMKHPLQS LGRPEDPQKV FSYSERAPGK CSSFKEQLSP RQLSQALRKP EYEQKVSPVS 
ASSPKEWRNS KKQLPPKHSS QASDRSKPQP QMSSKGPVHV FVKOSSGEKH LPSSSPFQQQ 
VHSSSVNAAA RRSVFESNSD NWFLGRDEAF AIKTKKFSOG SKNPIKSIPA PATKPGKFTI 
_ APVRQTSTSG GIYSKKEDLE SGDGNNNQHA NLSNQDDVEK LFGVRLKRAP PSQKYKSEKQ 

15 DNFTQLASVP SGPISSSVGR GHKIRSTSQG LLDAAGNLTK ISYVADKQQS RPKSESMAKK 

QPACKTPGKP AGQQSDYAVS EPWITMAKQ KQKSFKAHIS VKELKTKSNA GADAETKEPK 1 
YEGAGSANEN QPKKMFTESV HKQEKTAQMK PPKPTKSVGF EAQKILQVPA MEKETKRSST 1 
LPAKFQNPVE PIEPVWFSLA RKKAKAWSHM AEITQ 

20 SEQ ID HO:49 PAB7 DNA SEQUENCE 

Nucleic Acid Accession* D87742 

Coding sequence: 203-3582 (underlined sequences correspond to start and stop codons) 

25 j i 1 f i 1 i 1 i 1 

GCTTTCCTTT CTAAAGTAGA AGAGGATGAT TATCCCTCTG AAGAACTACT AGAGGATGAA 
AACGCTATAA ATGCAAAACG GTCTAAAGAA AAAAACCCTG GGAATCAGGG CAGGCAGTTT 
GATGTTAATC TGCAAGTCCC TGACAGAGCA GTTTTAGGGA CCATTCATCC AGATCCAGAA 
ATTCAAGAAA GCAAGCAAGA AACTAGTATG ATTTTGGATA GTGAAAAAAC AAGTGAGACT 
30 GCTGCCAAAG ■ GGGTCAACAC AGGAGGCAGG GAACCAAATA CAATGGTGGA AAAAGAACGC 
CCTCTGGCAG ATAAGAAAGC ACAGAGACCA TTTGAACGAA GTGACTTTTC TGACAGCATA 
AAAATTCAGA CTCCAGAATT AGGTGAAGTG TTTCAGAATA AAGATTCTGA TTATCTGAAG 



CTCTCAAAAG AGGACCATGG GAACACAGAG A 
GCTGCTGCAG AACCIGAAGA TGACTCGTTC CACTGGACTC CACATACAAG TGTAGAGCCA 
GGGCATAGTG ACAAGAGGGA GGACTTACTT ATCATAAGCA GCTTCTTTAA AGAACAACAG 
TCTTTGCAGC GGTTCCAGAA GTACTTTAAT GTCCATGAGC TGGAAGCCTT GCTACAAGAA 
ATGTCATCAA AACTGAAGTC AGCGCAGCAG GAGAGCCTGC CCTATAATAT GGAAAAAGTC 
CTAGATAAGG TCTTCCGTGC TTCTGAGTCA CAAATTCTGA GCATAGCAGA AAAAATGCTT 
G GGAATGAACG AAAATAACAT ATTTGAAGAG 



GCAGAGGAGA CAGCCACACT GGTGATGGCA CCACCTCTAG AGGAAGGCTT GGGTGGAGCA 
ATGGAAGAGA TGCAACCACT GCATGAAGAT AATTTCTCAC GAGAGAAGAC AGCAGAACTT 
AATGTGCAGG TTCCTGAAGA ACCCACCCAC TTGGACCAAC GTGTGATTGG GGACACTCAT 
GCCTCAGAAG TGTCACAGAA GCCAAATACT GAGAAAGACC TGGACCCAGG GCCAGTTACA 
ACAGAAGACA CTCCTATGGA TGCTATTGAT GCAAACAAGC AACCAGAGAC AGCCGCCGAA 
GAGCCGGCAA GTGTCACACC TTTGGAAAAC GCAATCCTTC TAATATATTC ATTCATGTTT 
TATTTAACTA AGTCGCTAGT TGCTACATTG CCTGATGATG TTCAGCCTGG GCCTGATTTT 
TATGGACTGC CATGGAAACC TGTATTTATC ACTGCCTTCT TGGGAATTGC TTCGTTTGCC 1440 
ATTTTCTTAT GGAGAACTGT CCTTGTTGTG AAGGATAGAG TATATCAAGT CACGGAACAG 
CAAATTTCTG 'AGAAGTTGAA GACTATCATG AAAGAAAATA CAGAACTTCT ACAAAAATTG 
TCAAATTATC AACAGAAGAT CAAGGAATCA AAGAAACATG TTCAGGAAAC CAGGAAACAA 
AATATGATTC TCTCTGATGA AGCAATTAAA TATAAGGATA AAATCAAGAC ACTTGAAAAA 
AATCAGGAAA TTCTGGATGA CACAGCTAAA AATCTTCGTG TTATGCTAGA ATCTGAGAGA 
GAACAGAATG TCAAGAATCA GGACTTGATA TCAGAAAACA AGAAATCTAT AGAGAAGTTA 
AAGGATGTTA TTTCAATGAA TGCCTCAGAA TTTTCAGAGG TTCAGATTGC ACTTAATGAA 
A GTGAAGAGAA GGTGAAGTCT GAATGCCATC GGGTTCAAGA AGAAAATGCT 
ft AGAAAAAAGA GCAGTTGCAG CAGGAAATCG AAGACTGGAG TAAATTACAT 
GCTGAGCTCA GTGAGCAAAT CAAATCATTT GAGAAGTCTC AGAAAGATTT GGAAGTAGCT 
CTTACTCACA AGGATGATAA TATTAATGCT TTGACTAACT GCATTACACA GTTGAATCTG 
TTAGAGTGTG AATCTGAATC TGAGGGTCAA AATAAAGGTG GAAATGATTC AGATGAATTO. 2160 
GCAAATGGAG AAGTGGGAGG TGACCGGAAT GAGAAGATGA AAAATCAAAT TAAGCAGATG 2220 
ATGGATGTCT CTCGGACACA GACT3CAATA TCGGTAGTTG AAGAGGATCT AAAGCITTTA 2280 
CAGCTTAAGC TAAGAGCCTC CGTGTCCACT AAATGTAACC TGGAAGACCA GGTAAftGAAA 2340 
TTGGAAGATG ACCGCAACTC ACTACAAGCT GCCAAAGCTG GACTGGAAGA TGAATGCAAA 2400 
ACCTTGAGGC AGAAAGTGGA GATTCTGAAT GAGCTCTATC AGCAGAAGGA GATGGCTTTG 2460 
CAAAAGAAAC TGAGTCAAGA AGAGTATGAA CGGCAAGAAA GAGAGCACAG GCTGTCAGCT 2520 
GCAGATGAAA AGGCAGTTTC GGCTGCAGAG GAAGTAAAAA CTTACAAGCG GAGAATTGAA 2S80 
GAAATGGAGG ATGAATTACA GAAGACAGAG CGGTCATTTA AAAACCAGAT CGCTACCCAT 2640 
GAGAAGAAAG CTCATGAAAA CTGGCTCAAA GCTCGTGCTG CAGAAAGAGC TATAGCTGAA 2700 
GAGAAAAGGG AAGCTGCCAA TTTGAGACAC AAATTATTAG AATTAACACA AAAGATGGCA 2760 , 
ATGCTGCAAG AAGAACCTGT GATTGTAAAA CCAATGCCAG GAAAACCAAA TACACAAAAC 2820 
CCTCCACGGA GAGGTCCTCT GAGCCAGAAT GGCTCTTTTG GCCCATCCCC TGTGAGTGGT 2880 
GGAGAATGCT CCCCTCCATT GACAGTGGAG CCACCCGTGA GACCTCTCTC TGCTACTCTC 2940 
AATCGAAGAG ATATGCCTAG AAGTGAATTT GGATCAGTGG ACGGGCCTCT ACCTCATCCT 3000 
CGATGGTCAG CTGAGGCATC TGGGAAACCC TCTCCTTCTG ATCCAGGATC TGGTACAGCT 3060 
ACCATGATGA ACAGCAGCTC AAGAGGCTCT TCCCCTACCA GGGTACTCGA TGAAGGCAAG 3120 
GTTAATATGG CTCCAAAAGG GCCCCCTCCT TTCCCAGGAG TCCCTCTCAT GAGCACCCCC 3180 
ATGGGAGGCC CTGTACCACC ACCCATTCGA TATGGACCAC CACCTCAGCT CTGCGGACCT 3240 
TTTGGGCCTC GGCCACTTCC TCCACCCTTT GGCCCTGGTA TGCGTCCACC ACTAGGCTTA 3300 
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AGAGAATTTG CACCAGGCGT TCCACCAGGA AGACGGGACC TGCCTCTCCA CCCTCGGGGA 3360 

TTTTTACCTG GACACGCACC ATTTAGACCT TTAGGTTCAC TTGGCCCAAG AGAGTACTTT 3420 

ATTCCTGGTA CCCGATTACC ACCCCCAACC CATGGTCCCC AGGAATACCC ACCACCACCT 3480 

GCTGTAAGAG ACTTACTGCC GTCAGGCTCT AGAGATGAGC CTCCACCTGC CTCTCAGAGC 3540 

ACTAGCCAGG ACTGTTCACA GGCTTTAAAA CAGAGCCC AT AAA ACTATGA CCTCTGAGGT 3600 

TTCATTGGAA AGAAAGTGTA CTGTGCATTA TCCATTACAG TAAAGGATTT CATTGGCTTC 3660 

AAAATCCAAA AGTTTATTTT AAAAGGTTTG TTGTTAGAAC TAAGCTGCCT TGGCAGTGTG 3720 

CATTTTTGAG CCAAACAATT CAAAAATGTC ATTTCTTCCC TAAATAAAAA TCACCTTTTA 3780 

AGCTAGAGCG TCCTTACAAC TTTGAAATGT GCAATAAAGA ATACCTGTGT TTTAGCTAAT 3B40 

GTAGCATATG TAATTGCAAA ATGATTTAGA ATGTCATGAA AAATATGAAC ATTTCCTGTG 3900 

GAAATGCTTT AAGAACATGT ATTTCCATTA TCCTATTTTT AGTGTACACC AGCTGAATAC 3960 

GGAGCAATGG TGTTTATAAG CGTTTTTTTA AACTATCTGG TCACAAAGAC TGTTACGCTA 4020 

AAAATGTTTA CTAAAAGATC ACTAAACTAT CTCCCCTCTT GCTGAAGTTC TTTGTAGTAA 4080 

TAGCTCATAA AAATTTGTTT ATTAATATTT CCCAAGTGTC TGTTGACTCA TTGGACTGTT 4140 

ATGAGGCTTG TGCCATTTGG GGAACATGTA AACTCAGGCT CCCAGAACTG AAGATGGTGG 4200 

CTGGTGGCAC ACTTCCGGCT GCTCCTCCGT CACCTGTGAA CTCTACAAGT GATGTCTTTT 4260 

TATTTCAAAG AAGTTTATTT CCCACTTGTA TAGCATTCAC ATGCTTTCTT TACGATCCTC 4320 

ATTGTCTATT TGAGAATGGT TTTCTGAGAG TGAGTTTACA TTAGTAGCAA GAGTTGTTTG 4380 

ACCTGATGTT CCATTGTTTT TACCATTCCT GTAGAAAAAG GGTGCACAAC AGAAAAATGA 4440 

AAATGATGTG TCATGGCCAT AAAAGTATAG AAATCTTTAA AAATTTTAAA ATGTACAGTC 4500 

CCTTATCTAT CTTTCCCATT CCTTGCCACT GATTTTTGAG GAATATAATA AAAAGATTGG 4560 

AAGAGTATAA TGCCATGAGA AAGAATGATT TAGGACTGTG AGGGTTATAA CATGCCCTAG 4620 

GTCAGCAACC AAGGGTTGAA ATCAGTTCTG TTTTAGGGGG AAATGGGGGG GGCGACAGAT 4680 

ATTATTCCAA AATTAATATT AATTAATATT TAAACGTTGG TGTTTTTATT TAAAAATCAG 4740 

TAACTAACCA TCTGGAATTG CACCATACTT AAAGTCTTAT CCATTACTAC ACTGTCTTTA 4800 

AAACAATGTT TCTTTAAATA CTCTACAACG TTTCTAAGAA CGAACTTCAG ACATTTTAAT 4860 
TACAGTAATA ATAGCACTCC T 




A AGAATAGAAG A 

5340 

CTCTTTAGGA CACAAAACAA TGCTGAAGTT AATATAATTT CTAATTTTAA ATGTCATTTA 5400 

AGTGTAGATT ATGCCATCTA GGAAGGTAAG TAGGAAAGGT AAATTAAATC TATTTTTAAA 5460 

T TTTCCCCTCT AAAGCCTTTT TTGGTGATTA TTCTGTATCT 5520 
5580 

GAGCCAGTCC ATAACTGCTT CCTCACATCC ATCTGATTGC ACCATTTCTG CAGCAAACCC 5640 

CAAAGCAGGG TGCCAATATG CAGATGGCAT AGGGAGTATC ATCCCTCAGC CAAATCACTT 5700 

TTCCATCTCT AAAGTTTCAT CTATTTTGGA AGTCATCTCC AACTAATTGT GTCTGGATTT 5760 

AGTTGCTAAA ATTGTCTTAT TTATTTATGA AGCAGCAATA TTCAGCCTGA AAGCATTTCT 5820 

GCCATAGTTG TTGTAGTTAT ATCGCCAATG GCTGATTTTT TTCATTGGAA AGTAAATTTA 5880 

AGTAATTCGT GGGATGTGGT ATATTCTGTG TCAACTTCAA GATAATCACT CATTTTCTCG 5940 
TTATATTCAG GTCTGAATTA AAGTTAAGTT AATCAC 

SEP ID NO:50PAB7 Protein secuence 



1 11 21 31 41 51 

I I I I I I 

AFI.SKVEEDD YPSEELLEDE NAINAKRSKE KNPGNQGRQF DVNLQVFDRA VLGTIHPDEE 60 

IEESKQETSM ILDSEKTSET AAKGVNTGGR EPNTMVEKER PLADKKAQRP PERSDFSDSI 120 

KIQTPELGEV FQNKDSDYLK NDHPEEHLKT SGLAGEPEGE LSKEDHGNTE KYMGTESQGS 180 

AAAEPEDDSP HWTPHTSVEP GHSDKREDLL IISSFFKEQQ SLQRFQKYFN VHELEALLQE 240 

MSSKLKSAQQ ESLPYNMEKV LDKVFRASES Q1LSIAEKHL ETRVAENRDL GMNENNIFEE 300 

AAVLDDIQDL IYFVRYKHST AEETATLVKA PPLEEGLGGA MEEMQPLHED NFSREKTAEL 360 

NVQVPEEPTH LDQRVIGDTH ASEVSQKPNT EKDLDPGFVT TEDTFMDAID ANKQPETAAE 420 

EPASVTPLEN AILLIYSFMF YLTKSLVATL PDDVOPGPDF YGLPWKPVFI TAFLGIASFA 480 

IFLWRTVLW KDRVYQVTEQ QISEKLKTIM KENTELVQKL SNYEQKIKES KKKVQETRKQ 540 

NMILSDEAIK YKDKIKTLEK NQEILDDTAK NLRVMLESER EQNVKNQDLI SENKKSIEKL 600 

KDVISMNASE FSEVQIALKE AKLSEEKVKS ECHRVQEENA RLKKKKEQLQ QEIEDWSKLH 660 

AELSEQIKSF EKSQKDLEVA LTHKDDNINA LTNCITQLNL LECESESEGQ NXGGNDSDEL 720 

ANGEVGGDRN EKMKNQIKQM MDVSRTQTAI SWEEDLKLL QLKLRASVST KCNLEDQVKK 780 

LEDDRNSLQA AKAGLEDECK TLRQKVEILN ELYQQKEMAL QKKLSQEEYE RQEREHRLSA 840 

ADEKAVSAAE EVKTYKRRIE EMEDELQKTE RSFKNQIATH EKKAHENWLK ARAAERAIAE 900 

EKREAANLRH KLLELTQKMA MLQEEPVIVK PMPGKPWTQN PPRRGPLSQN GSFGPSPVSG 960 

GECSPPLTVE PPVRPLSATL NRRDMPRSEF GSVDGPLPHP RWSAEASGKP SPSDPGSGTA 1020 

TMHNSSSRGS SPTRVLDEGK VNMAPKGPPP FPGVPLMSTP MGGPVPPPIR YGPPPQLCGP 1080 



SEQ ID N0:S1 PAB9 DMA SEQUENCE 

Nucleic Acid Accession!: NM.006457 

Coding sequence: 84-1874 (undertined sequences conespond to start and stop codons) 



AGACTGAGGC GGAGGCAGCC CCGCGCCGCG CCGGACCCGA GCATATT 
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20 

25 



TTGGACTTTG 
CTTGGGGTTT 
TAAAAGATGG 
TTGA.TGGAAT 
GTACAGGCTC 
TTCCTGTTCA 
CTGTGTCCAA 
GTTCIGTGTC 
CCCATGCGAC 
TGTTCGCTGC 
CACTGAGCGC 
GTTCCGAGAC 
AACAGCAAAA 
TACCCACTCA 



AGCCATTAGA 
CCGGCTGCAG 
CGGCAAGGCA 



TTTGAATATG 



AGTCACTTCC 
TTCACCAAAA 
CACCTCATCA 



TTCTCAGGAG 
TGGCCCACCA 
CAGTGATGCC 
AACAACTCAG 



ACCATGAGCA 
GGCGGTAAGG 
GCCCAGGCAA 
GGAATGACTC 
ACTCTGCAAA 
CCTAAAGAAG 
ACAAACAACA 
GTCACATCCA 
CATGCTTCCC 
CATGCTAATG 
GCAGTTAATG 
CTAGCAGAGG 
AGAAAACACA 



ACTACAGTGT 
ATTTCAACAT 
ATGTAAGAAT 
ATCTTGAAGC 
GAGCATCTGC 
TAGTTAAACC 
TGGCCTACAA 
TCCCATCACC 
CTTCACCCGT 
CCAATCTTAG 
TCCCACGGCA 
GACAGAGAAG 
TTGTGGAGCG 
GACTGATTGA 



GTCACTGGTT 



AGGCGATGTG 



TGCACCCAAG 
TGTGCCCATT 
TAAGGCACCA 
ATCGTCTGCC 
GGCTGCCGTC 
TCCTGACCAG 
GCCCACAGTC 



ATCTCTAGTC 

ATTAAGGGTT 
CCTGAGCCGG 
ACATCTCCTG 



TTCACCCCAG 
ACTCCTCCCC 
TCTCCATCTG 
ACCAGCGTGT 



GACTG 

AAATAACTCT CAGGAGCCTT 
GCCCGAGAGC C 



CAACCTCTGG CAGACCAGGG GTTACCAGCC TCACAACTGC AGCTGCCTTC AAGCCTGTAG 1140 
GATCCACTGG CGTCATCAAG TCACCAAGCT GGCAACGGCC AAACCAAGGA GTACCTTCCA 



A CTTTAGTGCA AAGAGCTGAG CACATTCCAG 



1260 
1320 
1380 



AATTCTTTGC CCCTGAATGT GGTCGATGCC AAAGGAAGAT CCTTGGAGAA GTCATCAATG 



35 

40 



AGTGGCCCTG 



TACTAATTAA 



AGAGACGGTT TGGCATTTAT TATTACTTTT TCCTGTATTT TATGCCCATA 



AGTGAAGAAT TTAATTTTAG 



AACAGAATTA TTGTATTTAA 
TTAAACAGAG AATTTTATCA 
GCGCGGTGGC TCACGCCTGT 
GAGGTCAGGA GTTTGAGATC 
ACAAAAATTA GCCGGACGCA 
GCACGAGAAT CACTTGAACC 
CACTCCAGCC TGGGTGACAG 
TATTTTTGCC TTACAGTGGA 



AATAAATAAT 
ATCTGCAAAA 
AAAAAAACTA 
GTAATAGGTG 
AATCCCAGCA 
AGCCTGGCCA 
GTGGCACGCG 
CGGGAGGGAG 



CCAATGTGAA 
GGCAATGAAA 
ATACTTATCT 



ATAATTATAC 
TTAAAATAGT 



CTTCTTTCCT 



CTTTGGGAGG 
ACATGGTGAA 
CCTGTAATCC 
AGGTTGCAGT 



CCAAGGTGGG 
ACCCCATCTC 
CAGCTACTCA 
GAGCCAAGAT 



AAATAGGATT 
TGTAGGCTGA 
TGGACCACAT 



AATAAGATTT 



AATAGAGGGC CAGGTGTGGT G 



GGGCGGATCA TGAGGTCAAC 



;A AACCCTGTCT 



2280 
2340 
2400 
2460 
2520 



CGTACCACTG 
GCTTGTATAT 
TTTATCAAAA 



3000 
3060 
3120 
3180 
3240 



60 
65 



MSNYSVSLVG PAPWGFRLQG 
MTHLEAQNK1 KGCTGSLNMT 
NNMAYNKAPR PFGSVSSPKV 
ANANIjSADQS psalsagkta 



I 



I 



I 



DNTKKANNSQ EPSPQLASLV 
PSWQRPHGOT PSTGRISHSA 
AHCNQVIRGP FLVALGKSWH 
RCQRKILGEV INALKQTWHV 
CEFPIEAGDM FLEALGYTWH 



GKDFNMPLTI SSLKDGGKAA QANVRIGDW LSIDGINAQG 
LQRASAAPKP EFVPVQKGEP KEWKPVPIT SPAVSKVTST 
TSIPSPSSAF TPAHATTSSH ASPSPVAAVT PPLFAASGLH 
SVCSETSQEL AEGQRRGSQG DSKQQNGPPR 
WRPRTGTTQS RSFRILAQIT GTEHLKESEA 
ASTRSMPESL DSPTSGRPGV TSLTTAAAPK PVGSTGVIKS 
TYSGSVAPAK SALGQTQPSD QDTLVQRAEH IFAGKRTPMC 
PEEFNCAHCK NTMAYIGFVE EKGALYCELC YEKPFAPECG 
SCPVCVACGK PIRNNVFHLE DGEPYCETDY YALFGTICHG 
DTCFVCSVCC ESLEGQTFFS KKDKPLCKKH AHSVNF 



SEQ ID N0:53 PBH7 DNA SEQUENCE 

Nucleic Acid Accession*: AA431407 

1-864 (underlined sequences correspond to start and slop codons) 



ATGGCCAACT GTAAAATGAC CAAAAGCATC AGGTTOCCTG CCCTGGAGCA CTGCTATACT 
GGCGGGGAGG TCGTGTTGCC CAAGGATCAG GAGGAGTGGA AAAGACGGAC GGGCCTTCTG 
CTCTACGAGA ACTATGGGCA GTCGGAAACG GGACTAATTT GTGCCACCTA CTGGGGAATG 
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15 
20 



30 
35 



AAGATCAAGC CGGGTTTCAT GGGGAAGGCC ACTCCACCCT ATGACGTCCA GTTTCATATG 240 

GAGGCCTCAG TTGAAAACTG CATTATTGTG AGCATGAACA CCGCTGACCC TGGCAGCCAG 300 

GGCATCACAC ACAGCCTCTT GCTACAGGTC ATTGATGACA AGGGCAGCAT CCTGCCACCT 360 

AACACAGAAG GAAACATTGG CATCAGAATC AAACCTGTCA GGCCTGTGAG CCTCTTCATG 420 

TGCTATGAGG GTGACCCAGA GAAGACAGCT AAAGTGGAAT GTGGGGACTT CTACAACACT 480 

GGGGACAGAG GAAAGATGGA TGAAGAGGGC TACATTTGTT TCCTGGGGAG GAGTGATGAC 540 

ATCATTAATG CCTCTGGGTA TCGCATCGGG CCTGCAGAGG TTGAAAGCGC TTTGGTGGAG 600 

GTGAAGGCCT TTATTGTCCT GACCCCACAG TTCCTGTCCC ATGACAAGGA TCAGCTGACC 720 

AAGGAACTGC AGCAGCATGT CAAGTCAGTG ACAGCCCCAT ACAAGTACCC AAGGAAGGTG 780 

GAGTTTGTCT CAGAGCTGCC AAAAACCATC ACTGGCAAGA TTGAACGGAA GGAACTTCGG 840 



CAAATCCCTG GCCACTTTAG TCTCCCCACT ATGGTGAGGA CGAGGGTGGG 
TCTTGATTTG GGAAAGTATC AGGAGTGCCA TGATTCCAAT GTTTTCCTTC 
AATTCAGTTG CTCTGCTTCC TCCAAGTCCT 
TCATAACGCA AGTAATAAAA TACTGATATC AACAA 



KIKPGFMGKA TPPYDVQFHM EASVENCIIV SMNTADPGSQ GITHSLLLQV IDDKGSII.PP 
NTEGNIGIRI KPVRPVSLFM CYEGDPEKTA KVECGDFYNT GDRGKMDEEG YICFLGRSDD 
IINASGYRIG PAEVESALVE HPAVAESAW GSPDPIRGEV VKAFIVLTPQ FLSHDKDQLT 
KELQQHVKSV TAPYKYPRKV EFVSELFKTI TGKIERKELR KKETGQM 

SEQ ID NO:55 PBJ5 DNA SEQUENCE 

Nucleic Acid Accession*: AP388200 

Coding sequence: 33-137 (undeitined sequences correspond to start and slop codons) 



r. XTATCTGGAG TGAAAGTAAA TGAGACCAAC AGAGATAAGA 240 

GACCCGGAGA AATCCTGGTT ACACTGCTTG AATCCTGTCA GTCCTATACT GGAGTCCTGT 300 

TAATACAAAA TAATAGTAAT AATCCCTCTG TTTCTTATGT TTATGCCAAC TTCAACAAAA 360 

AGAAACTTGA CTAAGAGACA ATATAAGAAC TTAATGTGTA ATTAAGAAAG AACTCTCCAC 420 

CACGGGGAAT GTGAAAGGTA TATGAGTCCC TTTTCACGAT GCGATGTCAT GTCTTTTAAA 480 
TAAGCCATAC TTTATGTTCA ATAAAAAGAG AATAAGCAGG A 



MCCEIYYRLL VLKMEKKSEE LRNMDGLGNV EKGH 

SEQ ID N0:57 PBJ7 DNA SEQUENCE 

nd to start and stop codons) 

1 11 21 31 41 51 

1 I I I I I 

ATGGACAGTT GCCTGCAACA TATGAGAGAC CTACTTTACC TCCTTCAGGA GCTCAGGTGT 
TTAAATCCAG CTACACTACT CCCTGATCCA GACTCCACTA CTCCTGTTCA TGACTGTCAG 
GATCTGTTGG AAACTACCAA AACTGGCCAA CCTGATCTTC AAGATGTGCC CCTAGAAAAG 
GCAGATGCCA CTGTGTTCAC AGATGGTAGC AGCTTCCTCG AGCAGGGAGA ACGAAAAGCT 
GTTTCTTTTC CACAGCCAGA TCTGCCTGAC AATCCCACAT ACTCAACAGA AGAAGAAAAA 
CTGGCTTCAG ATGTTGGAGC AAATAAAAAT CAGGAAGGAC GTGTATTCGC AAACACTACT 
TGGAGGGCCG GTACCTCCAA GGAAGTCTCC TTTGCAGTTG ATTTATGTGT ACTGTTCCCA 



65 GAGCCAGCTC GTACCCATGA AGAGCAACAT AATTTGCCGG TCATAGGAGC AGGAAGTGTC 
GACCTTGCAG CAGGATTTGG ACACTCTGGG AGCCAAACTG GATGTGGAAG CTCCAAAGGT 
GCAGAAAAAG GGCTCCAAAA TGTTGACTTT TACCTCICTC CTGGAAATCA CCCTGACGCT 
AGCTGTAGAG ATACTTACCA GTTTTTCTGC CCTGATTGGA CATGTGTAAO TTTAGCCACC 
TACTCTGGGG GATCAACTAG ATCTTCAACT CTTTCCATAA GTCGTGTTCC TCATCCTAAA 

70 TTATGTACTA GAAAAAATTG TAATCCTCTT ACTATAACTG TCCATGACCC TAATGCAGCT 
CAATGGTATT ATGGCATGTC ATGGGGATTA AGACTTTATA TCCCAGGATT TGATGTTGGG 
ACTATGTTCA CCATCCAAAA GAAAATCTTG GTCTCATGGA GCTOCCCCAA GCCAATCGGG 
CCTTTAACTG ATCTAGGTGA CCCTATATTC CAGAAACACC CTGACAAAGT TGATTTAACT 
GTTCCTCTGC CATTCTTAGT TCCTAGACCC CAGCTACAAC AACAACATCT TCAACCCAGC 

75 CTAATGTCTA TACTAGGTGG AGTACACCAT CTCCTTAACC TCACCCAGCC TAAACTAGCC 
CAAGATTGTT GGGTATCTTT AAAAGCAAAA CCCCCTTATT ATGTAGGATT AGGAGTAGAA 
GCCACACTTA AACGTGGCCC TCTATCTTGT CATACACGAC CCCGTGCTCT CACAATAGGA 
GATGTGTCTG GAAATGCTTC CTCTCTGATT AGTACCGGGT ATAACTTATC TGCTTCTCCT 
TTTCAGGCTA CTTGTAATCA GTCCCTGCTT ACTTCCATAA GCACCTCAGT CTCTTACCAA 

80 GCACCCAACA ATACCTGGTT GGCCTCCACC TCAGGTCTCA CTCGCTGCAT TAATGGAACT 
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GAACCAGGAC CTCTCCTGTG CGTGTTAGTT CATGTACTTC CCCAGGTATA TGTGTACAGT 1440 

GGACCAGAAG GACGACAACT CATCGCTCCC CCTGAGTTAC ATCCCAGGTT GCACCAAGCT 1500 

OTCCCACTTC TGGTTCCCCT ATTGGCTGGT CTTAGCATAG CTGGATCAGC AGCCATTGGT 1560 

ACGGCTGCCC TGGTTCAAGG AGAAACTGGA CTAATATCCC TGTCTCAACA GGTGGATGCT 1620 

GATTTTAGTA ACCTCCAGTC TGCCATAGAT ATACTACATT CCCAGGTAGA GTCTCTGGCT 1680 

GAAGTAGTTC TTCAAAACTG CCGATGCTTA GATCTGCTAT TCCTCTCTCA AGGAGGTTTA 1740 

TGTGCAGCTC TAGGAGAAAG TTGTTGCTTC TATGCCAATC AATCTGGAGT CATAAAAGGT 1800 

ACAGTAAAAA AAGTTCGAGA AAATC TAG AT AGGCACCAAC AAGAACGAGA AAATAACATC 1860 

CCCTGGTATC AAAGCATGTT TAACTGGAAC CCATGGCTAA CTACTTTAAT CACTGGGTTA 1920 

GCTGGACCTC TCCTCATCCT ACTATTAAGT TTAATTTTTG GGCCTTGTAT ATTAAATTCG 1980 

TTTCTTAATT TTATAAAACA ACGCATAGCT TCTGTCAAAC TTACGTATCT TAAGACTCAA 2040 
TATGACACCC T 



MDSCLQHMED LLYLLQELRC LNPATLLPDP DSTTPVHDCQ DLLETTKTGQ PDLQDVPLEK 60 

ADATVFTDGS SFLEQGERKA VSFPQPDLPD NPTYSTEEEK LASDVGANKN QEGRVFANTT 120 

WRAGTSKEVS FAVDLCVLFP EPARTHEEQH NLPVIGAGSV DLAAGFGHSG SQTGCGSSKG 180 

AEKGLQNVDF YLCPGNHPDA SCRDTYQFFC PDWTCVTLAT YSGGSTRSST LSISRVPHPK 240 

LCTRKMCHPL TITVHDPNAA QWYYGMSW3L RLYIPGFDVG TMFTIQKKIL VSWSSPKPIG 300 

PI/TDLGDPIF QKHPDKTOLT VPLPFLVPRP QLQQQHLQPS LMSILGGVHH LLNLTQPKLA 360 

QDCWLCLKAK PPYYVGLGVE ATLKRGPLSC HTRPRALTIG DVSGNASCLI STGYNLSASP 420 

FQATCNQSLL TSISTSVSYQ APNNTWLACT SGLTRCINGT EPGPLLCVLV HVX.PQVYVYS 480 

GPEGRQLIAP PELHPRLHQA VPLLVPLLAG LSIAGSAAIG TAALVQGETG LISLSQQVDA 540 

DFSNLQSAID 1LHSQVESLA EWLQNCRCL DLLFLSQGGL CAALGESCCF YANQSGVIKG 600 

TVKKVRENLD RHQQERENNI PWYQSMFNWN PWLTTLITGL AGPLLILIXS LTFGPCILNS 660 
FLNFIKQRIA SVKLTYLKTQ YDTLVNW 

SEQIDN0:59 PCQ1 DNA SEQUENCE 

Nucleic Acid Accession*: NM_019005 

Coding sequence: 1 82-1 885 (underlined sequences correspond to start and stop codons) 



TGATGGTGGA AATTTCTTGA AACCGCTCTC GTAATTTGCC ACGTGCTGTT G 
TGGTGAATGA ACACAGAATC AGCATGGCTT TCCTTTGCTG AGAAATCACT GATGGGAAGT 
GAGACTTGTT AAACTTGAAA GTGAATGGAC CTGAGTGGAC CCTTTGATCA CATCAGTAAA 
CATGAGCGGT ACCAAACCTG ATATTTTATG GGCACCACAC CATGTTGATA GATTTGTTGT 
GTGTGACTCA GAACTAAGTC TTTATCATGT GGAATCTACT GTGAATTCAG A 



ACCCTATATG AAATGTGTTG CCTGGTATCT TAATTATGAT CCTGAATGTC TGCTGGCAGT 420 

TGGACAAGCA AATGGTGGAG TTGTACTTAC AAGCCTTGGT CAAGATCATA ACTCAAAGTT 480 

CAAAGATTTG ATAGGAAAAG AGTTTGTTCC AAAACATGCA CGACAATGTA ATACCCTTGC 540 

CTGGAATCCA CTGGATAGTA ACTGGGTAGC TGCTGGTTTA GATAAGCACA GAGCTGACTT 600 

TTCAGTGCTA ATATGGGATA TCTGCAGCAA ATATACTCCT GATATAGTTC CCATGGAAAA 660 

AGTGAAACTT TCAGCAGGTG AAACTGAAAC AACATTATTA GTAACAAAAC CACTTTATGA 720 

GTTAGGACAG AATGATGCTT GTCTGTCTCT TTGTTGGCTT CCACGAGACC AGAAACTTCT 780 

CCTTGCTGGT ATGCATCGTA ACCTAGCTAT ATTTGATCTT CGGAATACAA GCCAAAAGAT 840 

GTTCGTA&AT ACAAAAGCTG TTCAGGGTGT GACGGTAGAC CCATATTTCC ACGATCGTGT 900 

TGCTTCCTTC TATGAAGGTC AGGTTGCAAT ATGGGATCTT AGAAAATTTG AGAAGCCAGT 960 

TTTGACATTG ACTGAGCAAC CAAAACCCTT AACAAAAGTA GCATGGTGTC CCACTAGGAC 1020 

TGGTCTACTT GCCACTTTAA CAAGGGATAG TAATATTATT AGATTGTATG ATATGCAGCA 1080 

TACACCCACT CCCATTGGGG ATGAAACTGA ACCCACAATA ATTGAAAGAA GTGTGCAACC 1140 

TTGTGACAAT TACATTGCTT CCTTTGCGTG GCATCCAACA AGTCAAAATC GAATGATAGT 1200 

TGTAACTCCC AACCGAACAA TGTCAGACTT CACTGTTTTT GAAAGGATAT CTCTTGCCTG 1260 

T ACATCTTTAA TGTGGGCTTG TGGTOGTCAT TTATATGAAT GTACGGAAGA 1320 



Oft 1440 

TCCACAGCTC AAGTCACTCT GGTATACTCT GCACTTTATG AAGCAATACA CAGAAGATAT 1500 

GGATCAGAAA TCTCCAGGCA ACAAAGGATC ATTGGTTTAT GCAGGAATTA AATCAATTGT 1560 

AAAGTCATCG TTGGGAATGG TGGAAAGCAG CAGACATAAT TGGAGTGGGT TGGATAAGCA 1620 

AAGTGATATT CAAAACTTAA ATGAAGAGAG AATCTTAGCT TTACAGCTTT GTGGGTGGAT 1680 

AAAGAAAGGA ACGGATGTAG ACGTGGGGCC ATTTTTGAAC TCCCTTGTAC AAGAAGGGGA 1740 

ATGGGAAAGA GCTGCTGCTG TGGCATTGTT CAACTTGGAT ATTCGCCGAG CAATCCAAAT 1800 

CCTGAATGAA GGGGCATCTT CTGAAAAAGG CAGGAGATCT GAATCTCAAT GTGGTAGCAA 1860 

TGGCTTTATC GGGTTATACG GATGAGAAGA ACTCCCTTTG GAGAGAAATG TGTAGCACAC 1920 

TGCGATTACA GCTAAATAAC CCGTATTTGT GTGTCATGTT TGCATTTCTG ACAAGTGAAA 1980 

CAGGATCTTA CGATGGAGTT TTGTATGAAA ACAAAGTTGC AGTACGTGAC AGAGTGGCAT 2040 

TTGCTTGTAA ATTCCTTAGT GATACTCAGA TACATCGAAA AGTTGACCAA TGAAATGAAA 2100 

GAGGCTGGAA ATTTGGAAGG AATTTTGCTT ACAGGCCTTA CTAAAGATGG AGTGGACTTA 2160 

ATGGAGAGTT ATGTTGATAG AACTGGAGAT GTTCAAACAG CAAGTTACTG TATGTTACAG 2220 

GGTTCACCTT TAGATGTTCT TAAAGATGAA AGGGTTCAGT ACTGGATTGA GAATTATAGA 2280 

AATTTATTAG ATGCCTGGAG GTTTTGGCAT AAACGAGCTG AATTTGATAT TCACAGGAGT 2340 

AAGTTGGATC CCAGTTCCAA GCCTTTAGCA CAAGTTTTTG TGAGTTGCAA TTTCTGTGGC 2400 

AAGTCAATCT CCTACAGCTG TTCAGCTGTG CCTCATCAGG GCAGAGGTTT TAGTCAGTAT 2460 

GGTGTGAGTG GCTCACCAAC GAAATCTAAA GTCACAAGTT GTCCTGGCTG TCGAAAACCA 2520 

CTTCCTCGAT GTGCGCTOTG TCTCATTAAT ATGGGAACAC CAGTTTCTAG CTGTCCTGGA 2580 



324 



WO 02/30268 



PCT/US01/32045 



GGAACCAAAT CAGATGAAAA AGTGGACTTG AGCAAGGACA AAAAATTAGC CCAATTTAAC 2610 

AACTGGTTTA CATGGTGTCA TAATTGCAGG CACGGTGGAC ATGCTGGACA TATGCTTAGT 27 DO 

TGGTTCAGGG ACCATGCAGA GTGCCCTGTG TCTGCATGCA CGTGTAAATG TATGCAGTTG 2760 

GATACAACGG GGAATCTGGT ACCTGCAGAG ACTGTCCAGC CATAAAATGT TACCACCTTA 2820 

AGAGAACCCT TCAAGTGTGG AGCTTTCTAG TAGGTGTCCT TCATAGCTCA GAAACATACC 2880 

TCAGAACAAG GCATTCATGA CTTACCTGTA ATGGGAAAAT AAATCATTCT ATCAGAAAAA 2940 
AAAAAAAAAA A 



1 11 21 31 41 51 

I I I I I I 

MSGTKPDI LW APHHVDRFW CDSELSLYHV ESTVNSELKA GSLRLSEDSA ATLLSINSDT 
PYMKCVAWYL NYDPECLLAV GQANGRWLT SLGQDHNSKF KDLIGKEFVP KHARQCNTLA 
WNPLDSNWLA AGLDKHRADF SVLIWDICSK YTPDIVPMEK VKLSAGETET TLLVTKPLYE 
LGQNTACLSL CWLPRDQKLL LAGMHRNLAI FDLRNTSQKM FVNTKAVQGV TVDPYFHDRV ■ 
ASFYEGQVAI WDLRKFEKPV LTLTEQPKPL TKVAWCPTRT GLLATLTRDS NIIRLYDMQH 
TPTPIGDETE PTIIERSVQP CDNYIASFAW HPTSQNRMIV VTPNRTMSDF TVFERIELAW 
SPITSLMWAC GRHLYECTEE ENDNSLEKDI ATKMRLRALS RYGLDTEQVW RNHILAGNED 
PQLKSLWYTL HFMKQYTEDM DQKSPGNXGS LVYAGIKSIV KSSLGMVESS RHNWSGUJKQ 
SDIQNLNEER ILALQLCGWI KKGTDVBVGP FLNSLVQEGE WERAAAVALF NLDIRRAIQI 



25 SEQ ID NO:61 PDG3 DNA SEQUENCE 

Nucleic Acid Accession* U42359 

Coding sequence: 563-775 (undallned sequences correspond lo start and stop codons) 

1 11 21 31 41 51 

30 | | | | | | 

TTGTACATCT TAACAACCTT AAGCTGTACA AATAGAfl " " — 

GATCAGCCCA CAGTACACAT CATTGATGAG AATTTCACTG G 
GAGTCCTGGC TTTGTAAAAT GACTTATAAA GGTCCAJ 
_ AAGCTGGCAT TCTGTAAAGG CACCATCGTC TATCCCCTGT CTTATCTAGA TAAAGAATGT 

35 AGTGCTAAAT CTTGTAATAA TATTGTACAA ATGGAAATTC AATCTTAAGG A 

CCATATTGTT GTATTTCATT GTGGTGTATT GGAAAGTGAT CTGGACTTTG A 
TGTGATTTGG ACCATGGCAC TTAAAAACTC TATAACCTCA GGCAAGTCTT TTAATCTTCT 
CTGAGCCTCA GTTTTCCTCA TTTTTCAAAT ATAGAGAGTA T. 
AAGTTGTAGT AAATTACTGT TTTACAAATG TAAGATZ 

T ACATTAT TAT G TTTATCTGC CACAGGGAGA AGTCCTCAGA TAAAAATGTC 
A CTC3ACACGTG GAGTTAATCA TTTGACAGAT GCAAATGCTT C 
CAAATATACT TTCTTTAACT TCTGTGTGGG TATCAC1 

AAATATTTTT TAATTCTATC TTAGGAAAAA TTGTAGNCAA ATCTTTTTNT CCCATTAACA 
AATAATGTAA GCCTTAATAT TCAAGGGGTA ATAAAAATAC AAAGTCTTCC A 
CTTAC TTGAA AACTTT 



1 11 21 31 41 51 

I 1 I I I I 

MGARGAPSRR RQAGRRLRYL PTGSFPFLLL LLLLCIQLGG GQKKKENLLA EKVEQLMEWS 

SRRSIFRMNG DKFRKFIKAP FRNYSMIVMF TALQPQRQCS VCRQANEEYQ ILANSWRYSS 

AFCNKLFFSM VDYDEGTDVF QQLNMNSAPT FXHXPPKGRP KRADTFDLQR IGFAAEQLAK 

W1ADRTDVHI RVFRP PNYSG TIALALLVSL VGGLLYXRRH HLEFIYNK'fG WAMVSLCIVF 

AMTSGQMWNH IRGPPYAHKN PHNGQVSYIH GSSQAQFVAE SHIILVLNAA ITMGMVLLHE 

K RRIICLVGLG LWFFFSFLL SIFRSKYHGY PYSDLDFE 

SEQ ID NO:63 PDG8 DNA SEQUENCE 



60 

„ I 

CD GGTCGCCGCA CCGGCCGCCT CCGGCCCGCC GCCGCCCCCA G ~~ 

GGGGCGCCCA CCGCGCTGCC AGCCTACCCC GCGGCCGAGC C 
CAGGGCGAGC CGCTGCATTT CTGCTGCCTA GACTTCAGCC 1 

CCGGGCTGGC GGCTGAACCG TAAGCCCATT GAGTCCACGC T 

CTGGTCATCG TGGTGTGGAG CGTGGCCGCC CTCATCTGGC CGGTGCCCAT CATCGCCGGC 

70 TTCCTGCCCA ACGGCATGGA ACAGCGCCGG ACCACCGCCA GCACCACCGC AGCCACCCCC 

GCCGCAGTGC CCGCAGGGAC CACCGCAGCC GCCGCCGCCG CCGCCGCTGC CGCCGCCGCC 
GCGGCCGTCA CTTOGGGGGT GGCGACCAAG TGACCCGCTC CGCTCCTCCC TGTGTCCGTC 
CTGTGTCCGC GCGCGCGGGT GCCTTTCCCG CCGGGGACTC GGCCGGTGTG CTTCGTGCTG 
TAGTTATCGT TAGTTCCTCT TCCCGAGATG GGGCCGCCGA GAGACCCCAG CGCCTTTGAA 

75 AAGCAAGGTT TGTGCTGCGC TTCCAGTTCC GAAAAGCAGA TGTTTAAGCC CTTGGACTGA 

GGGTGGGATC GCAGCTCCGA AGACGGAGAG GAGGGAAATG GGGCCCTTTC CCCTCTATTG 
CATCCCCCTG CCCGACTCCT TCCCCGCACC CACGTGCCCT AGATTCATGG CAGAAAATGA 
CCAAATCCTG TGTATTTGTT TTATATATTT AATAACTGTT TTAAATGAAA GTTTTAGTAA 

c _ AAAAAATACA AAACAAAAAG ATTAAATTGC TATTGCTGTA GTAAGAGAAG CTCTTTGTAT 

80 CTGAACATAG TTGTATTTGA AATTTGTGGT TTTTTAATTT ATTTAAAATT GGGGGGAGGG 
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5 

10 



55 
60 
65 



CATGGGAAGG ATTTAACACC GATATATTGT TACCGCTGAA AATGAACTTT ATGAACCTTT 1020 
TCCAAGTTGA TCTATCCAGT GACGTGGCCT GGTGGGCGTT TCTTCTTGTA CTTATGTGGT 1080 
TTTTTGGCTT TTAATACAGA CATTTTCCTC CAAAAAAAAA AAAAAAAAGG 



I I I I I I 

GRRTGRLRPA AAPSAAAATA GAPTALPAYP AAEPPGPLWL QGEPLHFCCL DFSLEELQGE 
PGWRLNRKPI ESTLVACFMT LVIWWSVAA LIWPVPIIAG FLPNGMEQRR TTASTTAATF 
AAVPAGTTAA AAAAAAAAAA AAVTSGVATK 



15 Nucleic Acid Accession* 



SEQIDNO:65 PDM1 DNA SEQUENCE 

NM_006765 



Coding sequence: 149-1 1 9a (underlined sequences correspond to start and slop codons) 



GCTGCTGCTC TGCATCCAGC 



TGGAGGTTTG CTTTATTNGA GAAGGAACAA CTTGGAGTTC ATCTATAACA AGACTGGTTG 840 



CCGTGGACCT CCATATGCTC ATAAGAACCC ACACAATGGA CAAGTGAGCT ACATTCATGG 960 

CACCATGGGG ATGGTTCTTC TAAATGAAGC AGCAACTTCG AAAGGCGATG TTGGAAAAAG 1080 

ACGGATAATT TGCCTAGTGG GATTGGGCCT GGTGGTCTTC TTCTTCAGTT TTCTACTTTC 1140 

40 AATATTTCGT TCCAAGTACC ACGGCTATCC TTATAGTGAT CTGGACTTTG AGTGAGAAGA 1200 

TGTGATTTGG ACCATGGCAC TTAAAAACTC TATAACCTCA GCTTTTTAAT TAAATGAAGC 1260 

CAAGTGGGAT TTGCATAAAG TGAATGTTTA CCATGAAGAT AAACTGTTCC TGACTTTATA 1320 

CTATTTTGAA TTCATTCATT TCATTGTGAT CAGCTAGCTT ATTCTTGTGT ACTTTTTTTA 1380 

. _ AACTGTGGGT TTTCCTAGTA AATTTAATTT ACAGAAATCA ATGGTAGCAT TTAGTAATCT 1440 

45 ACAAAGGAAA TATCAAAGTG TTTTTCAAGC CTGTTATATY CAGTGTGTKC CACAGGATTG 1500 

50 



SEQ ID N0:67 PDM2 DNA SEQUENCE 

NM_000947 

Coding sequence: 88-1617 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGTTTCATAT GAACTCTCCC GCCACCCGGG AACAGCTGGC TGCCACCGTT TGTGTTTTCC 
GAGTTTGTAT TCTTGCAGGT GACCAAGATG GAGTTTTCTG GAAGAAAGCG GAGGAAGCTG 
AGGTIGGCAG GTGACCAGAG GAATGCTTCC TACCCTCATT GCCTTCAGTT TTACTTGCAG 
CCACCTTOTG AAAACATATC TTTAACAGAA TTTGAAAACT TGGCTATTGA TAGAGTTAAA 
TTGTTAAAAT CAGTTGAAAA TCTTGGAGTG AGCTATGTGA AAGGAACTGA ACAATACCAG 
AGTAAGTTGG AGAGTGAGCT TCGGAAGCTC AAGTTTTCCT ACAGAGAGAA GCTAGAAGAT 
GAATATGAAC CACGAAGAAG AGATCATATT TCTCATTTTA TTTTGCGGCT TGCTTATTGC 
CAGTCTGAAG AACTTAGACG CTGGTTCATT CAACAAGAAA TGGATCTCCT TCGATTTAGA 
TTTAGTATTT TACCCAAGGA TAAAATTCAG GATTTCTTAA AGGATAGCCA ATTGCAGTTT 
GAGGCTATAA GTGATGAAGA GAAGACTCTT CGAGAACAGG AGATTGTTGC CTCATCACCA 
AGTTTAAGTG GACTTAAGTT GGGGTTCGAG TCCATTTATA AGATCCCTTT TGCTGATGCT 
CTGGATTTGT TTCGAGGAAG GAAAGTCTAT TTGGAAGATG GCTTTGCTTA CGTACCACTT 



WO 02/30268 



PCT/USO 1/32045 



AAQGACATTG TGGCAATCAT CCTGAATGAA TTTAGAGCCA AACTGTCCAA GGCTTTGGCA 
TTAACAGCCA GGTCGTTGCC TGCTGTGCAG TCTGATGAAA GACTTCAGCC TCTGCTCAAT 
CACCTCAGTC ATTCCTACAC TGGCCAAGAT TACAGTACCC AGGGAAATGT TGGGAAGATT 
TCTTTAGATC AGATTGATTT GCTTTCTACC AAATCCTTCC CACCTTGCAT 
CATAAAGCCT TGCGGGAAAA TCACCATCTT CGTCATCGAG GCCGAATGCA 
TTTCTGAAGG GCATTGGTTT AACTTTGGAA CAGGCATTGC AGTTCTGGAA GCAAGAATTT 
ATCAAAGGAA AGATGGATCC AGACAAGTTT GATAAAGGTT ACTCTTACAA CATCCGTCAC 
AGCTTTGGAA AGGAAGGCAA GAGGACAGAC TATACACCTT TCAGTTGCCT GAAGATTATT 
CTGTCCAATC CACCAAGCCA AGGGGATTAT CATGGGTGCC CATTCCGTCA CAGTGATCCA 
GAGCTGCTGA AGCAAAAGTT GCAGTCATAC AAGATCTCTC CTGGAGGGAT AAGCCAGATT 
TTGGATTTAG TAAAGGGGAC ACATTACCAG GTAGCCTGTC AAAAATACTT TGAGATGATA 
CACAATGTGG ATGATTGTGG CTTTTCTTTG AATCATCCTA ATCAGTTCTT TTGTGAGAGC 
CAACGTATTC TAAATGGTGG TAAAGACATA AAGAAGGAAC CTATCCAACC AGAAACTCCT 
CAACCCAAAC CAAGTGTCCA GAAAACCAAG GATGCATCAT CTGCTCTGGC CTCTTTAAAT 
TCCTCTCTGG AAATGGATAT GGAAGGACTA GAAGATTACT TTAGTGAAGA TTCTTAGGCA 
GTTTTATAAC CCTTTTTCCT CAATAGCCTG TTTCCTGTTT TTAAGATTTT GCCTTTGTTG 
TTGAAAAAGG GTTTCACTGT CACCAAGGCT TAGTGCAGTG ACACAATTAC AGCTGATTGC 
AGCCTTGACC TTCCCAGCTC AAGTGATCCT CCTACCTCAG CCTCCCAAGT AGTTAGGACA 
CACAGGTGTG CACCTCATAT CCAGATAATT TTTTTCAATT TTTTTTTGTA GAGGTGGGGG 



AGCGTCCCAG AGTGCTGGGA TTACAGTTGT GAGCCACTGT GCCTGGCCTT TTTTTTTTTT 

TAACCTTTTC GTTTAACTTC TCTCTTCACT GCATCCCAAT CCATCTACAG GCATGCACAC 

TTATTAGGAA AGGAGGTTTG AGGTAACAAC AGAGACTTTC ACTATATTTT GCTTTGACAG 

AAGGAAAGAG GAGGAGTTTC TATTAAAATC TGTCACTTGA GTGATGTCAT TTAAGTCCTA 2160 

TTTTAGGAGA TAAAAACAGC TTTGGGGACT GGTTAAAGTC CCCCAGAAAC TACAATAAAG 2220 

AACAACTTTT GTTTTAACTC TTAATCACTT TGTAATTTTG ACTCAATCCT TTTCTGGACC 2280 
ATTTTTGTTA ATAAATATCA AAGTGT 



1 11 21 31 41 51 

I I I I I I 

MEFSGRKRBK LRLAGDQRNA SYPHCLQFYL QPPSENISLT EFENLAIDRV KLLKSVENLG 

VSYVKGTEQY QSKLESELRK LKFSYREKLE DEYEPRRRDH ISHFILRLAY CQSEELRRWF 

IQQEMDLLRF RFSILPKDKI ODFLKDSQLQ FEAISDEEKT LREQEIVASS PSLSGLKLGF 

ESIYKIPFAD ALDLFRGRKV YLEDGFAYVP LKDIVAIILN EFRAKLSKAL ALTARSLPAV 

QSDERLQPLL NHLSHSYTGQ DYSTQGNVGK ISLDQIDLLS TKSFPPCMRQ LHKALRENHH 

LRHGGRMQYG LFLKGIGLTL EQALQFWKQE FIKGKMDPDK FDKGYSYNIR HSFGKEGKRT 

DYTPFSCLKI ILSNPPSQGD YHGCPFRHSD PELLKQKLQS YKISPGGISQ ILDLVKGTHY 

QVACQKYFEM IHNVDDCGFS LNHPNQFFCE SQRILNGGKD IKKEPIQP3T PQPKPSVQKT 
KDASSALASL KSSLEHDMEG LEDYFSEDS 

SEQ ID NO:69 PDM3 DNA SEQUENCE 

Nucleic Acid Accession*: NM_024840 

108-491 (underlined sequences correspond to start and stop codons) 



120 

180 



AGAGAAGCCC TATGAATGCA CTGAATGTGA CAAAGCATTC CGCTGGAAAT CACAGCTCAA 
TGCACATCAG AAAGCTCACA CAGGAGAGAA GTCATATATA TGCCGTGATT GTGGAAAAGG 
CTTCATTCAG AAGGGAAATC TCATTGTACA TCAGCGAATT CATACTGGAG AAAAACCCTA 
TATATGCAAT GAATGTGGAA AAGGCTTCAT CCAAAAGGGC AACCTCCTTA TTCATCGACG 
TACTCACACT GGAGAGAAAC CCTATGAATG CAATGAATGT G 
GACATGT TTA A TATCCCATC AGAGATTTCA CACAGGAAAG A 
GTGTGGAAAA TCCTGCTCAC ACAAGTCAGG TCTCATTAAC CACCAGAGAA TTCACACAGG 600 
AGAGAAACCC TATACATGCA GTGACTGTGG GAAAGCTTTC AGAGATAAAT CATGTCTOAA 660 
CAGACATCGG AGAACTCATA CAGGGGAGAG ACCGTATGGA TGCTCTGATT GTGGGAAAGC 720 
TTTCTCCCAC TTGTCATGCC TTGTTTATCA TAAGGGAATG CTGCATGCAA GAGAGAAATG 780 
TGTAGGTTCA GTCAAATTGG AAAATCCTTG CTCAGAGAGT CATAGCTTAT CACATACACG 840 
TGATCTCATA CAGGATAAAG ACTCTGTTAA CATGGTGACT CTGCAGATGC CTTCTGTGGC 900 
AGCTCAGACC TCATTAACTA ACAGTGCGTT CCAAGCAGAG AGCAAAGTAG CCATTGTGAG 960 
CCAGCCTGTT GCCAGAAGTT CAGTCTCAGC AGATAGTAGA ATTTGCACAG AATAAAAACC 1020 
ATATGAATGC AGTGAATGTC GTAGTGCTTT CAGTGATCAA TTACATCATA TGTCACAAAA 1080 
AACACAGAGG AACAAACTGA TATATTCAAG GTGGAAAGCC CTTGAATAAA ACCTTATGGC 1140 
TAATAAGCAT ATACTCAGAG AAAAATAGTA TGAAGTGGAG ACTGGGAAAT TCTTTTATGG 1200 
GAAGATAGAT CTTCTCATCA GTGACCATAG ATCACATCTT CAGTGAGCTT ATAGTTGGTA 1260 
FVECATGGA 

GGTTTACACA O 

CTAGTGGTAC ATTCTGCCTT ATCCTCAGAG GGAATCATAT AGAAATAAAA CTATGAAAAT 
GTAACTAGAA CATCTTCATC AAAATATGAA AGAACACACG AAGCAAATAA GCCCTGTGAA 
AAGGAGTATT TTAGAGATTT CGATCAGAAA TCTAACATCA TTATATGGCA GATAATATAC 
AGGATCTGTA TTTTAGGACA ATATACCTTG AATCACTAGT TGATATGTCA ATGACTAATT 
AAAAGGGGTT GTCAGTGTTA CACATCATTG GTTAAATTTA TAGCACAATG TACCTCTTCC 
CCCTTTTTTG ATAAGAGTCT TCTATTCCCA ACCAAGATCA TTATATGATT AGCTCTTGTG 
TTTCTTTGAT TCCAAATTTC TTCACTTGTT ATTTCAGACT ACTGAAGCTC TTCAAAAGGA 
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SEQ|DNQ;7PPPM3Prpteinse 
ProteinAccession#: NP. 



MDAACVGRPS PKGPGSOTTR 
IVEKASFRRE ISLYISEFIL 
ASARRHV 



, - SEQID N0:71 P0M8 DNA SEQUENCE 

1j Nucleic Acid Accession*: NM_0 18455 

Coding sequence: 341-955 (underlined sequences correspond to start and stop codons) 



AGGTTGCACA CTTCTAAGAA GAGCGGCGTG GGGGGCTCGG CGACCTTCGC TTCAGTCGCT 
CCCCCGTGCA GTCCCCTGTG CCCAAGACAC AGCCTGATGC TTGTGCTCCG GTGGGCGGAC 
TTGGAGGCGG CGGGAACTGC AATTGGTGGC TTTGAAGGGC GGCGAGCGGG AACAGCTCTT 
GAGGAGTGAG ACTGCAGGAG ATGTGGGCCG TGCCAAAGAG ATGGATGAGA CTGTTGCTGA 
GTTCATCAAG AGGACCATCT TGAAAATCCC CATGAATGAA CTGACAACAA TCCTGAAGGC 
CTGGGATTTT TTGTCTGAAA ATCAACTGCA GACTGTAAAT TTCCGACAGA GAAAGGAATC 
TGTAGTTCAG CACTTGATCC ATCTGTGTGA GGAAAAGCGT GCAAGTATCA GTGATGCTGC 
CCTGTTAGAC ATCATTTATA TGCAATTTCA TCAGCACCAG AAAGTTTGGG ATGTTTTTCA 
GATGAGTAAA GGACCAGGTG AAGATGTTGA CCTTTTTGAT ATGAAACAAT TTAAAAATTC 
GTTCAAGAAA ATTCTTCAGA GAGCATTAAA AAATGTGACA GTCAGCTTCA GAGAAACTGA 
GGAGAATGCA GTCTGGATTC GAATTGCCTG GGGAACACAG TACACAAAGC CAAACCAGTA 
CAAAOCTACC TACGTGGTGT ACTACTCCCA GACTCCGTAC GCCTTCACGT CCTCCTCCAT 
GCTGAGGCGC AATACACCGC TTCTGGGTCA GGAGTTAGAA GCTACTGGGA AAATCTACCT 
CCGACAAGAG GAGATCATTT TAGATATTAC CGAAATGAAG AAAGCTTGCA ATTAGTGAAC 
ATGAAAGGAA AATAAAAATT CCTCACAGTC AAAAAAAAAA AAAAA 



MDETVAEFIK RTILKIPMNE LTTILKAWDF LSENQLQTVN FRQRKESWQ HLIHLCEEKR 60 

0 IIYMQFHQHQ KVWDVFQMSK GFGEDVDLFD MKQFKNSFKK ILQRALKNVT 120 

IV VWIRIAWGTQ YTKPNQYKPT YWYYSQTPY AFTSSSHLRR NTPLLGQELE 180 
ATGKIYLRQE EIILDITEMK KACN 

s SEQ ID N0:73 PDM9 DNA SEQUENCE 

Nucleic Acid Accession #: NM.016192 

Coding sequence: 1-1125 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGGTGCTGT GGGAGTCCCC GCGGCAGTGC AGCAGCTGGA CACTTTGCGA GGGCTTTTGC 60 

TGGCTGCTGC TGCIGCCCGT CATGCTACTC ATCGTAGCCC GCCCGGTGAA GCTCGCTGCT 120 

TTCCCTACCT CCTTAAGTGA CTGCCAAACG CCCACCGGCT GGAATTGCTC TGGTTATGAT 180 

GACAGAGAAA ATGATCTCTT CCTCTGTGAC ACCAACACCT GTAAATTTGA TGGGGAATGT 240 

TTAAGAATTG GAGACACTGT GACTTGCGTC TGTCAGTTCA AGTGCAACAA TGACTATGTG 300 

CCTGTGTGTG GCTCCAATGG GGAGAGCTAC CAGAATGAGT GTTACCTGCG ACAGGCTGCA 360 

TGCAAACAGC AGAGTGAGAT ACTTGTGGTG TCAGAAGGAT CATGTGCCAC AGATGCAGGA 420 

TCAGGATCTG GAGATGGAGT CCATGAAGGC TCTGGAGAAA CTAGTCAAAA GGAGACATCC 480 

ACCTGTGATA TTTGCCAGTT TGGTGCAGAA TGTGACGAAG ATGCCGAGGA TGTCTCGTGT 540 

GTGTGTAATA TTGACTGTTC TCAAACCAAC TTCAATCCCC TCTGCGCTTC TGATGGGAAA 600 

TCTTATGATA ATGCATGCCA AATCAAAGAA GCATCGTGTC AGAAACAGGA GAAAATTGAA 660 

GTCATGTCTT TGGGTCGATG TCAAGATAAC ACAACTACAA CTACTAAGTC TGAAGATGGG 720 

CATTATGCAA GAACAGATTA TGCAGAGAAT GCTAACAAAT TAGAAGAAAG TGCCAGAGAA 780 

CACCACATAC CTTGTCCGGA ACATTACAAT GGCTTCTGCA TGCATGGGAA GTGTGAGCAT 840 

TCTATCAATA TGCAGGAGCC ATCTTGCAGG TGTGATGCTG GTTATACTGG ACAACACTGT 900 

GAAAAAAAGG ACTACAGTGT TCTATACGTT GTTCCCGGTC CTGTACGATT TCAGTATGTC 960 

TTAATCGCAG CTGTGATTGG AACAATTCAG ATTGCTGTCA TCTGTGTGGT GGTCCTCTGC 1020 

ATCACAAGGA AATGCCCCAG AAGCAACAGA ATTCACAGAC AGAAGCAAAA TACAGGGCAC 1080 
TACAGTTCAG ACAATACAAC AAGAGCGTCC ACGAGGTTAA TCTGA 
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MVLWESPRQC SSWTLCEGFC W 
DRENDLFLCD TNTCKFDGEC LRIGDTVTCV CQFKCNNDYV PVCGSNGESY QNECYLRQAA 
CKQQSEILW SEGSCATDAG SGSGDGVHEG SGETSQKETS TCDICQFGAE C 
VCNIDCSQTN FNPLCASDGK SYDNACQIKE ASCQKQEKII 



SEQ ID NO:75 PD01 DNA SEQUENCE 
spond to start and stop codons) 



GGCGCCGGGA TTGGGAGGGC TTCTTGCAG 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



C TATGGTCCTG GCTGACTTCG GGGCGCGTGT 
C CGCTTGGGCC GGGGCAAGCG 
CTCGCTAGTG CTGGACCTGA AGCAGCCGCG GGAGCCGCGT GCTGCGGCGT CTGTGCAAGC 
GGTCGGATGT GCTGCTGGAG CCCTTCCGCC GCGGTGTCAT GGAGAAACTC CAGCTGGGCC 
CAGAGATTCT GCAGCGGGAA AATCCAAGGC TTATTTATGC CAGGCTGAGT GGATTTGGCC 
AGTTCAGGAA AGCTTCTGCC GGTTAGCTGG CCACGATATC AACTATTTGG CTTTGTCAGG 
TGTTCTCTCA AAAATTGGCA GAAGTGGTGA GAATCCGTAT GCCGCGCTGA ATCTCGTGGC 
TGACTTTGCT GGTGGTGGCC TTATGTGTGC ACTGGGCATT ATAATGGCTC TTTTTGACCG 
CACACGCACT GACAAGGGTC AGGTCATTGA TGCAAATATG GTGGAAGGAA C. 
AAGTTCTTTT CTGTGGAAAA C 



A GCAATAGAAC 



ACGGGGCTCG 
GTTAAACACC 
GGAGATACTT 



GAGGAACAGT 
CTACAGTGAT 
TGGGTACTTA 



TGATATTAAG 



TCTTGAAGAC 
AAATGCCACA 
GGCCTTTTGT 



GAAGAATTTG 
AGTAATAAGG 
CTGCATTTAC 
ATTACAGTGT 
GATTGAATTC 
TACTAAATTA 
TGACT 
ATCGATATAC 
AATTGTATGG 
CTTGGTGTTC 



AGATGAGCAC 
AGACGAAGGC AGAGTGGTGT 
GGAGGTTGTT 



AACACATAAC 
TAATCAAGAA 
TATCATTAGG 
TCTGCCTTCC 



TAAAAATGGT 



CTGAAAAAAA 
GGGACAGTCA 
CTCTGGGCTG 
TTCTGGATCT 
AAAAAAAAAA 



CATATCCAAA 



TATATTTTGA 
ATTTATTTAC 
TGATAAAAGT 
ATGATCTCCC 
AAAGAAAAGT 
ATAATGAGGA 



CACGTGAAAC 
TCTAAGCACA 
TTCACCTGTA 



TCAGCTTTCC 
TATACCCAAC 
AAAAAAAAAA 



11 



I 



TTTCTCCATG 
ACACAGCAAC 



21 



CAGTAACTCG 
TGTTTGATTT 
ATCCAGAAAT 



ATTGTATGCA 
AAGAATTACA 
GCTTTTGATT 
AGTTTGCTTG 
GTGAAAAAGG 
CTACAATGTA 
AGAGTGATTG 
TTCCAAACTT 
TTGAATCAGA 
CTCACTACGT 
GGGCCTGTTT 
CTCCTCAGGC 
AAAGATCTCA 



41 



AGAAGAAGTT 900 



ACTCAGATAA 
GGCTCAAGTG 
TGGAAACATG 
GACTCTGATT 
TATAAAACTT 
ATATATTTGT 



GAAAATGAGG 
GTTGCATCCA 
TAGCAACAGT 
ATGCCTTCAA 



I 



51 

I 



LSGLAPGRXC AMVLADFGAR WRVDRPGSR YDVSRLGRGK RSLVLDLKQP 
REPRAAASVQ AVGCAAGALP PRCHGETPAG PRDSAAGKSK AYLCQAEWIW FVQESFCRLA 
GHDINYLALS GVLSKIGRSG ENPYAPJJJLV ADFAGGGLMC ALGIIMALFD RTRTDKGQVI 
DAHMVEGTAY LSEFLWKTQK SSLWEAPRGQ NMLDGGAPFY TTYRTADGEF MAVGAIEPQF 
YELLIKGLGL KSDELPNQMS TDDWPEMKKK FADVFAKKTK AEWCQIFDGT DACVTPVLTF 
EEWHHDHNK ERGSFITSEE QDVSPRLAPI. LLNTPAIPSS KGDPFIGEHT EEILEEFGFS 
REEIYQLNSD KIIESNKVKA SL 

SEQIDN0:77PDO3DNAi 

Nucleic Acid Accession*: AB028951 

Coding sequence: 97-1128 (underlined sequences correspond to start and stop codons) 



I 



31 



i 



i GATTCTTGAT GGTATCCATT ACCTCCATGC A 
CTTCACAGAG ACTTGAAACC AGCAAATATC CTAGTAATGG GAGAAGGTCC TGAGAGGGGG 
AGAGTCAAAA TAGC TGAC AT GGGTTTTGCC AGATTATTCA ATTCTCCTCT AAAGCCACTA 
GCAGATTTGG ATCCAGTAGT TGTGACATTT TGGTATCGGG CTCCAGAACT TTTGCTTGGT 
GCAAGGCATT ATACAAAGGC CATTGATATA TGGGCAATAG GTTGTATATT TGCTGAATTG 
TTGACTTCGG AACCTATTTT TCACTGTCGT CAGGAAGATA TAAAAACAAG CAATCCCTTT 
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GAAGATATTA GAAAGATGCC AGAATATCCC ACACTTCAAA AAGACTTTAG AAGAACAACG 480 

TATGCCAACA GTAGCCTCAT AAAGTACATG GAGAAACACA AGGTCAAGCC TGACAGCAAA 540 

GTGTTCCTCT TGCTTCAGAA ACTCCTGACC ATGGATCCAA CCAAGAGAAT TACCTCGGAG 600 

5 CAAGCTCTGC AGGATCCCTA TTTTCAGGAG G ACCC TTTGC CAACATTAGA TGTATTTGCC 660 

GGCTGCCAGA TTCCATACCC CAAACGAGAA TTCCTTAATG AAGATGATCC TGAAGAAAAA 720 

GGTGACAAGA ATCAGCAACA GCAGCAGAAC CAGCATCAGC AGCCCACAGC CCCTCCACAG 780 

10 GACTCCAGCC TGAACCAGGT GCCTCCAAAC AAGAAGCCAC GGCTAGGGCC TTCAGGCGCA 960 

AACTCAGGTG GACCTGTGAT GCCCTCGGAT TATCAGCACT CCAGTTCTCG CCTGAATTAC 1020 

CAAAGCAGCG TTCAGGGATC CTCTCAGTCC CAGAGCACAC TTGGCTACTC TTCCTCGTCT 1080 

CAGCAGAGCT CACAGTACCA CCCATCTCAC CAGGCCCACC GGTACTGACC AGCTCCCGTT 1140 

GGGCCAGGCC AGCCCAGCCC AGAGCACAGG CTCCAGCAAT ATGTCTGCAT TGAAAAGAAC 1200 

15 CAAAAAAATG CAAACTATGA TGCCATTTAA AACTCATACA CATGGGAGGA AAACCTTATA 1260 

TACTGAGCAT TGTGCAGGAC TCATAGCTCT TCTTTATTGA CTTAAAGAAG ATTCTTGTGA 1320 

AGTTTCCCCA GCACCCCTTC CCTGCATGTG TTCCATTGTG ACTTCTCTGA TAAAGCGTCT 1380 

GATCTAATCC CAGCACTTCT GTAACCTTCA GCATTTCTTT GAAGGATTTC CTGGTGCACC 1440 

TTTCTCATGC TGTAGCAATC ACTATGGTTT ATCTTTTCAA AGCTCTTTTA ATAGGATTTT 1500 

20 AATGTTTTAG AAACAGGATT CCAGTGGTGT ATAGTTTTAT ACTTCATGAA CTGATTTAGC 1560 

AACACAGGTA AAAATGCACC TTTTAAAGCA CTACGTTTTC ACAGACAATA ACTGTTCTGC 1620 

TCATGGAAGT CTTAAACAGA AACTGTTACT GTCCCAAAGT ACTTTACTAT TACGITCGTA 1680 

TTTATC TAGT TTCAGGGAAG GTCTAATAAA AAGACAAGCG GTGGGACAGA GGGAACCTAC 1740 

AACCAAAAAC TGCCTAGATC TTTGCAGTTA TGTGCTTTAT GCCACGAAGA ACTGAAGTAT 1800 

25 GTGGTAATTT TTATAGAATC ATTCATATGG AACTGAGTTC CCAGCATCAT CTTATTCTGA 1860 

ATAGCATTCA GTAATTAAGA ATTACAATTT TAACCTTCAT GTAGCTAAGT CTACCTTAAA 1920 

AAGGGTTTCA AGAGCTTTGT ACAGTCTCGA TGGCCCACAC CAAAACGCTG AAGAGAGTAA 1980 

CAACTGCACT AGGATTTCTG TAAGGAGTAA TTTTGATCAA AAGACGTGTT ACTTCCCTTT 2040 

GAAGGAAAAG TTTTTAGTGT GTATTGTACA TAAAGTCGGC TTCTCTAAAG AACCATTGGT 2100 

30 TTCTTCACAT CTGGGTCTGC GTGAGTAACT TTCTTGCATA ATCAAGGTTA CTCAAGTAGA 2160 

AGATATAGAG TGACTATTTT TAAAGCATGT TAAAAATTTA GGTTTTAOTC ATGTTTAAAG 2280 

TTTTCATTGC ACTGAATGAT TTCTTTTGCC CCTAGGAGAA AACTTAATAA TTGTGCCTAA 2400 

35 AAACTATGGG CGGATAGTAT AAGACTATAC TAGACAAAGT GAATATTTGC ATTTCCATTA 2460 

TCTATGAATT AGTGGCTGAG TTCTTTCTTA GCTGCTTTAA GGAGCCCCTC ACTCCCCAGA 2520 

GTCAAAAGGA AATGTAAAAA CTTAGAGCTC CCATTGTAAT GTAAGGGGCA AGAAATTTGT 2580 

GTTCTTCTGA ATGCTACTAG CAGCACCAGC CTTGTTTTAA ATGTTTTCTT GAGGTAGAAG 2640 

40 TTATCTACCT GGTTATGATA CTGTGGGTCC AXACACAAGT AAAATAAGAT TAGACAGAAG 2760 

CCAGTATACA TTTTGCACTA TTGATGTGAT ACTGTAGCCA GCCAGGACCT TACTGATCTC 2820 

AGCATAATAA TGCTCACTAA TAATGAAGTC TGCATAGTGA CACTCATCAA GACTGAAGAT 2880 

GAAGCAGGTT ACGTGCTCCA TTGGAAGGAG TTTCTGATAG TCTCCTGCTG TTTTACCCCT 2940 

. _ TCCATTTTTT AAAATAAGAA ATTAGCAGCC CTCTGCATAA TGTAGCTGCC TATATGCAGT 3000 

45 TTTATCCTGTGCCCTAAAGC CTCACTGTCC AGAGCTGTTG GTCATCAGAT GCTTATTGCA 3060 

CCCTCACCAT GTGCCTGGTG CCCTGCTGGG TAGAGAACAC AGAGGACAGG GCATACTTCT 3120 

TGTCCTTAAG GAGCTTGTGA TCTGTGACAG TAAGCCCTCC TGGGATGTCT GTGCCATGTG 3180 

ATTGACTTAC AAGTGAAACT GTCTTATAAT ATGAAGGTCT TTTTGTTTAC TTCTAAACCC 3240 

ACTTG5GTAG TTACTATCCC CAAATCTGTT CTGTAAATAA TATTATGGAA GGGTTTCTAT 3300 

50 GTCAGTCTAC CTTAGAGAAA GCCAGTGATT CAATATCACA AAAGGCATTG ACGTATCTTT 3360 

GAAATGTTCA CAGCAGCCTT TTAACAACAA CTGGGTGGTC CTTGTAGGCA GAACATACTC 3420 

TCCTAAGTGG TTGTAGGAAA TTGCAAGGAA AATAGAAGGT CTGTTCTTGC TCTCAAGGAG 3480 

GTTACCTTTA ATAAAAGAAG ACAAACCCAG ATAGATATGT AAACCAAAAT ACTATGCCCC 3540 

TTAATACTTT ATAAGCAGCA TTGTTAAATA GTTCTTACGC TTATACATTC ACAGAACTAC 3600 

55 CCTGTTTTCC TTGTATATAA TGACTTTTGC TGGCAGAACT GAAATATAAA CTGTAAGGGG 3660 

ATTTCGTCAG TTGCTCCCAG TATACAATAT CCTCCAGGAC ATAGCCAGAA ATCTCCATTC 3720 

CACACATGAC TGAGTTCCTA TCCCTGCACT GGTACTGGCT CTTTTCTCCT CTTTCCTTGC 3780 

CTCAGGGTTC GTGC TACCCA CTGATTCCCT TTACCCTTAG TAATAATTTT GGATCATTTT 3840 

- CTTTCCTTTA AAGGGGAACA AAGCCTTTTT TTTTTTTGAG ACGGAGTGTT GCTCTGTCAC 3900 

60 CCAAGCTGGA GTGCAGTGGC ACGATCTTGG CTCACTCCAA CCTCCACCTT CCAGGTTCAA 3960 

GTGATTCTCC TGCCTCAGCC TCCCGAGTAG CTGGGACTAC GGGCACGCAC CACCACGTCT 4020 
GGCTAATTTT TGTATTTTTA GTAGAGATGG GGTTTCACCC TATTGGTCAG 
AATTCCTCAC CTCAGGTCAT CCGCCTGTCT CGGCCTCCCG AAGTGCTGGG 
TGAGCCACCG CAC CCAGTTG GGAACAAAGC CTTTTTAACA CACGTAAGGG 
05 GTGGGACCTC TAAGGAGACC TTTGAAGCTT TTTGAGGGCA AACTTTACCT 
CAAATGATGG CATTTCTCTT TGAAATTTAT TAGATACTGT TATGTCCCCC 
GAGGGGCATC CCTCAGCCTA TGGGAACACC CAAACTAGGA GGGGTTATTG 
ATGAATCCAA GTGAAGGCTT TCTGCTCTTC GTGTTACAAA CCAGTTTCAG 
CTGGGGAGGT GTGTGTTTGT GAAAGGAATT CAAGTGTTGC AGGACAGATG 
70 AAGGTAGCTT TGGCAGCAGG GCTGATACTA TGAGGCTGAA ACAATCCTTG 
GAICATGCAG TGACATACAA AGACCAAGGA TTATGTATAT TTTTATATCT 
GAAACTTTAG TACTTAGAAT TTTGGCCTTC TGCACTACTC TTTTGCTCTT 
TGGACTCTTA AGAATGGAAA GGGATGACAT TTACCTATGT GTGCTGCCTC 
AAGCAACTGC TACTTGTTCT CTATGCCTCT AAAATGATGC TGTTTTCTCT 
75 AAAGAAAAGA AAAAAATAGT TGGAAAATAA GACATGCAAC TTGATGTGCT 
TTTATGCAGC AGAAACTATA CAATGAAGGA AGAATTCTAT GGAAATTACA 
TCTATGATGA TGTCTTCCTA GGGAGTAGAG AAAGGCAGTG AAATGGCAGT 
GAGGCTTGAA GGATTCAAGT ACAAGTAATA TTTTGTATAA AACATAGCAG 
or> CATAATCCTC AAAAATAGTC ACAAATATAA CAAAGTTCAT TGTTTTAGGG 
OU ACGTGTTGTA CCTAAGGCCA TACTTACTCT TCTATGCTAT CACTGCAAAG 



4080 

ATTATAGGTG 4140 
CCC1CAAACC 4200 

AAGGGTACAG 4320 
ACAGGAAGGA 4380 
AGTTAGCTTT 4440 



TGATGAAGTA 4560 

CTGTGGTTTT 4620 

ACGAACATAA 4680 

ATTCCTGGTG 4740 

GCTAAAGGTA 4800 

TTTGAGTAAA 4860 

AATCCAAAAC 4920 

TAGACCAACA 4980 

TTTAGGTCCC 5040 

TTTTTAAAAA 5100 

GGG1GATATG 5160 
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TATGTATTAT ATAAAAAAAA AAACCCTTAA TGCACTGTTA TCTCCTAAAT ATTTAGTAAA 5220 

TTAATACTAT TTAATTTTTT TAAAGATTTG TCTGTGTAGA CACTAAAAGT ATTACACAAA 5280 

ATCTGGACTG AAGGTGTCCT TTTTAACAAC AATTTAAAGT ACTTTTTATA TATGTTATGT 5340 

AGTATATCCT TTCTAAACTG CCTAGTTTGT ATATTCCTAT AATTCCTATT TGTGAAGTGT 54O0 

ACCTGTTCTT GTCTCTTTTT TCAGTCATTT TCTGCACGCA TCCCCCTTTA TATGGTTATA 5460 

GAGATGACTG TAGCTTTTCG TGCTCCACTG CGAGGTTTGT GCTCAGAGCC GCTGCACCCC 5520 

AGCGAGGCCT GCTCCATGGA GTGCAGGACG AGCTACTGCT TTGGAGCGAG GGTTTCCTGC 5580 



I I I I I I 

VKSLLYQILD GIHYLHANWV LKRDLKPAtU LVMGEGPERG RVKIADMGFA RLFNSPLKPL 
ADLDPVWTF WYRAPELLLG ARHYTKAIDI WAIGCIFAEL LTSEPIFHCR QEDIKTSNPF 
HHDQLDRIFS VMGFPADKDW EDIRKMPEYP TLQKDFRRTT YANSSLIKYM EKHKVKPDSK 
VFLLLQKLLT MDPTKRITSE QALQDPYFQE DPLPTLDVFA GCQIPYPKRE FLNEDDPEEK 
GDKNQQQQQN QHQQPTAPPQ QAAAPPQAPP PQQNSTQTNG TAGGAGAGVG GTGAGLQHSQ 
DSSLNQVPPN KKPRLGPSGA NSGGPVMPSD YQHSSSRLNY QSSVQGSSQS OSTLGYSSSS 
QQSSQYHPSH QAHRY 

SEQ ID NO:79 PD05 DNA SEQUENCE 

Nucleic Acid Accession*: XMJM2922 

Coding sequence: 1-2190 (underlined sequences correspond to start and stop codons) 



AACTATCCAC TGAGCATTGC CTTCATTGTG GTGAATGAAT TCTGCGAGCG C 

T GATCCTGTAT TTCCTGTATT TCCTGCACTG GAATGAAGAT 240 
:ttcagc AGCCTCTGTT ATTTTACTCC CATCCTGGGA 300 
G GTTGGGAAAA TTCAAGACAA TCATCTATCT CTCCTTGGTG 360 
G GCCATGTGAT CAAGTCCTTG GGTGCCTTAC CAATACTGGG AGGACAAGTG 420 
GTACACACAG TCCTATCATT GATCGGCCTG AGTCTAATAG CTTTGGGGAC AGGAGGCATC 480 
AAACCCTGTG TGGCAGCTTT TGGTGGAGAC CAGTTTGAAG AAAAACATGC AGAGGAACGG 540 
ACTAGATACT TCTCAGTCTT CTACCTGTCC ATCAATGCAG G 
ATCACACCCA TGCTGAGAGG AGATGTGCAA TGTTTTGGAG AAGACTGCTA Tl 
TTTGGAGTTC CAGGACTGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT GGGAAGCAAA 
ATATACAATA AACCACCCCC TGAAGGAAAC ATAGTGGCTC AAGTTTTCAA ATGTATCTGG 
TTTGCTATTT CCAATCGTTT CAAGAACCGT TCTGGAGACA TTCCAAAGCG ACAGCACTGG 
CTAGACTGGG CAGCTGAGAA ATATCCAAAG CAGCTCATTA TGGATGTAAA GGCACTGACC 
AGGGTACTAT TCCTTTATAT CCCATTGCCC A 
TCACGATGGA CTTTGCAAGC CATCAGGATG A 



C CCAGTCAGGT CCCCAGGAGG TTTTCCTACA AGTCTTGAAT 1260 
CTGGCAGATG ATGAGGTGAA GGTGACAGTG GTGGGAAATG AAAACAATTC TCTGTTGATA 1320 
GAGTCCATCA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT GAAAACAAAA 1380 
AGCCAGGATT TTCACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 
GTGCAGGAGA AGAACTGGTA CAGTCTTGTC ATTCGTGAAG ATGGGAACAG TATCTCCAGC 
ATGATGGTAA AGGATACAGA ARGCAAAACA ACCAATGGGA TGACAACCGT GAGGTTTGTT 
AACACTTTGC ATAAAGATGT CAACATCTCC CTGAGTACAG ATACCTCTCT CAATGTTGGT 
GAAGACTATG GTGTGTCTGC TTATAGAACT GTGCAAAGAG GAGAATACCC TGCAGTGCAC 
TGTAGAACAG AAGATAAGAA CTTTTCTCTG AATTTGGGTC TTCTAGACTT TGGTGCAGCA 1740 
TATCTGTTTG TTATTACTAA TAACACCAAT CAGGGTCTTC AGGCCTGGAA GATTGAAGAC 
ATTCCAGCCA ACAAAATGTC CATTGCGTGG CAGCTACCAC AATATGCCCT GGTTACAGCT 
GGGGAGGTCA TGTTCTCTGT CACAGGTCTT GAGTTTTCTT ATTCTCAGGC TCCCTCTAGC 
ATGAAATCTG TGCTCCAGGC AGCTTGGCTA TTGACAATTG CAGTTGGGAA TATCATCGTG 
CTTGTTGTGG CACAGTTCAG TGGCCTGGTA CAGTGGGCCG AATTCATTTT GTTTTCCTGC 2040 
CTCCTGCTGG TGATCTGCCT GATCTTCTCC ATCATGGGCT ACTACTATGT TCCTGTAAAG 2100 
ACAGAGGATA TGCGGGGTCC AGCAGATAAG CACATTCCTC ACATCCAGGG GAACATGATC 2160 



11 21 31 41 51 

I I I I I 

E EVPPRPPSPP KKPSPTICGS KYPLSIAFIV VNEFCERFSY 
D TSTSIYHAFS SLCYFTPILG AAIADSWLGK FKTIIYLSLV 
YVLGHVIKSL GALPILGGQV VHTVLSLIGL SLTALGTGGI KPCVAAFGGD QFEEKHAEER 
TRYFSVFYLS INAGSLZSTF ITPKLRGDVQ CFGEDCYAIA FGVPGLtMVI ALWFAMGSK 
IYNKPPPEGN IVAQVFKCIW FAISNRFKNR SGDIPKRQHW LDWAAEKYPK QLIMDVKALT 
RVLFLYIPLF MFWALLDQQG SRWTLQAIRM NRNLGFFVLQ PDQMQVLNPF WLIFIPLFD 
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VQEKNWYSLV IREDGNSISS MMVKDTESKT TNGMTTVRPV NTLHKDVNIE LSTDTSLNVG 
EDYGVSAYRT VQRGEYPAVH CRTEDKNFSL NLGLLDFGAA YLFVITNNTN QGLQAWKIED 
IPANKMSIAW QLPQYALVTA GEVKFSVTGL EFSYSQAPSS MKSVLQAAWL LTIAVGNIIV 



SEQ ID N0:81 PD06 DNA SEQUENCE 

Nucleic Acid Accession #: NM_020448 

Coding sequence: 1 -1 221 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AK3SACGGAT CCCACAGCGO AGCCCTGAAG CTGCAGCAGC TGCCTCCCAC AAGTAGCTCC 
AGCGCCGTAA GCGAGGCCTC CTTCTCCTAC AAGGAAAACC TGATTGGCGC CCTCTTGGCG 
ATCTTCGGGC ACCTCGTGGT CAGCATTGCA CTTAACCTCC AGAAGTACTG CCACATCCGC 
CTGGCAGGCT CCAAGGATCC CCGGGCCTAT TTCAAGACCA AGACATGGTG GCTGGGCCTG 
TTCCTGATGC TTCTGGGCGA GCTGGGTGTG TTCGCCTCCT ACGCCTTCGC GCCGCTGTCA 
CTCATCGTGC CCCTCAGCGC AGTTTCTGTG. ATAGCTAGTG CCATCATAGG AATCATATTC 
ATCAAGGAAA AGTGGAAACC GAAAGACTTT CTGAGGCGCT ACGTCTTGTC CTTTGTTGGC 
TGCGGTTTGG CTGTCGTGGG TACCTACCTG CTGGTGACAT TCGCACCCAA CAGTCACGAG 
AAGATGACAG GCGAGAATGT CACCAGGCAC CTCGTGAGCT GGCCTTTCCT TTTGTACATG 
CTGGTGGAGA TCATTCTGTT CTGCTTGCTG CTCTACTTCT ACAAGGAGAA GAACGCCAAC 
AACATTGTCG TGATTCTTCT CTTGGTGGCG TTACTTGGCT CCATGACAGT GGTGACAGTC 
AAGGCCGTGG CTGGGATGCT TGTCTTGTCC ATTCAAGGGA ACCTGCAGCT TGACTACCCC 
ATCTTCTACG TGATGOTCGT GTGCATGGTG GCAACCGCCG TCTATCAGGC TGCGTTTTTG 
AGTCAAGCCT CACAGATGTA CGACTCCTCT TTGATTGCCA GTGTGGGCTA CATTCTGTCC 
ACAACCATTG CTATCACAGG AGGTGCAATA TTTTACCTGG ACTTCATCGG GGAGGACGTG 
CTGCACATCT GCATGTTTGC ACTGGGGTGC CTCATTGCAT TCTTGGGCGT CTTCTTAATC 



GGTATCCAGA ACATGCACGA TAAAGGGATG ACTGTCCAGC CTGAACTTAA AGCTTCTTTT 1080 

TCCTATGGGG CTCTGGAAAA CAATGACAAC ATTTCTGAGA TCTACGCTCC TGCCACCCTG 1140 

CCAGTCATGC AAGAAGAGCA CGGCTCCAGA AGTGCCTCTG GGGTCCCCTA CCGAGTCCTA 1200 
GAGCACACCA AGAAGGA ATG A 



1 11 21 31 41 51 

I I I I I I 

MDGSHSAALK LQQLPPTSSS SAVSEASFSY KENLIGALLA IFGHLWSIA LNLQKYCHIR 
LAGSKDPRAY FKTKTWWLGL FLMLLGE LGV FASYAFAPLS LIVPLSAVSV I. 
IKEKWKPKDF LRRYVLSFVG CGLAWGTYL LVTFAPNSHE KMTGENUTRH L' 
LVEIILFCLL LYFYKEKNAN NIWILLLVA LLGSMTWT7 KAVAGMLVLS IQGNtQLDYP 
IFYVMFVCMV ATAVYQAAFL SQASQMYDSS LIASVGYILS TTIAITAGAI FYLDFIGEDV 
LHICMFALGC LIAFLGVFLI TRNHKKPIPF EPYISMDAMP GMQNMHDKGM T 



SEQ ID NO:83 PD08 DNA SEQUENCE 

Nucleic Acid Accession*: NM.032712 

Coding sequence: 555-908 (underlined sequences correspond to start and slop codons) 
1 11 21 31 41 51 

I I I I I I 

CACTCATTAA GAACAGAGGA GGCTGCCTGT TACTCCTGGT GTTGCATCCC TCCAGACACT 
CTGCTGTTTC CTGCCTAGGC GTGGCTGCAG CCATGGCTAG GAAAGCGCTG CCACCCACCC 
ACCTGGGCCA GAGCTGGTTC TGCTCCTGCT GCAGGGACAC TGAGCTGGCT ATCTCGGCGC 
TTCGGGCAAG AACTGCAACA GGCTCTCCTG GGTCCTGCAG GTGTACAGCC GGGCCCCTGC 
CTTGTGCCTC AGCTCTCGAG AGCTGCTGCT GCCGGGTGAC CTGATCCAAC CTGATAAGGT 
GCCATC TTCA GCTACCACTG CAAGGCCCTG AGGGCAACAG CAGCACGGCA CTGCCCACCC 
GGCTGCTGAT GGCCTGGTGC CAGCTGGGAG TCCTCCCGGC ACTTCGAGGC CACTGAGCCA 
CCCTTCCAGC CCCAGCCCAC CATGGACAGG GGTATCCAGC TTCCTCCTCA ACCTCGTCCT 
CTGCCCCTGA GCCAGTGACG CCCAAGGACA TGCCTGTTAC CCAGGTGCTG TACCAGCACT 
AGCTGGTCAA GGGCATGACA GTGCTGGAGG CCGTCTTGGA GATCCAGGCC ATCACTGGCA 
GCAGGCTGCT CTCCATGGTG CCAGGGCCCG CCAGGCCACC AGGCTCATGC TGGGACCCAA 
CCCAGTGCAC AAGGACTTGG CTGCTGAGCC ACACACCCAG GAGAAGGTGG ATAAGTGGGC 



C GTGAGACCTG GGCCTGGCTC TCAAGGACAG 840 

ACACCGCCTC GCCTGGTGCT CCAGGGGTGA AGCAGGCCAG AATCCTGGGG GAGCTGCTCC 900 

TGGTTTGAGC TGCATTCAGG AAGTGCGGGA CATGGTAGGG GAGGCAAAAA GCCTTGGGCA 960 

CTACCCTCCC TGTGGAGCTG 1TCGGTGTCC GTCGAGCTAG CCACACCCTG ACACCATGTT 1020 

CAAGGGTACC GGAAGAGAAG GGTGTCTGCC CCCAACCTCC CCTGTGGGTG TCACTGGCCA 1080 

GATGTCATGA GGGAAGCAGG CCTTGTGAGT GGACACTCAC CATGAGTCCC TGGGGGGAGT 1140 

GATCCCCCAG GCATCGTGTG CCATGTTGCA CTTCTGCCCA GGCAGCAGGG TGGGTGGGTA 1200 

CCATGGGTGC CCACCCCTCC ACCACATGGG GCCCCAAAGC ACTGCAGGCC AAGCAGGGCA 1260 
ACCCCACACC CTTGACATAA AAGCATCTTG AAGCTTTTAA AAAAAAAAAA AAAAAA 
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SEQ ID NO:85 PDT1 DNA SEQUENCE 

Nucleic Acid Accession*: NMJW0693 

Coding sequence: 53-1591 (underlined sequences correspond to start and stop codons) 



CGCTAACGGG GCCGTGGAAA ACGGGCAGCC GGACGGGAAG C 



CATCCGCAAC CTGGAGGTCA AGTTCACCAA GATATTTATC AACAATGAAT GGCACGAATC 
CAAGAGTGGG AAAAAGTTTG CTACATGTAA CCCTTCAACT CGGGAGCAAA TATGTGAAGT 
GGAAGAAGGA GATAAGCCCG ACGTGGACAA GGCTGTGGAG GCTGCACAGG TTGCCTTCCA 
GAGGGGCTCG CCATGGCGCC GGCTGGATGC CCTGAGTCGT GGGCGGCTGC TGCACCAGCT 
GGCTGACCTG GTGGAGAGGG ACCGCGCCAC CTTGGCCGCC CTGGAGACGA TGGATACAGG 
GAAGCCATTT CTTCATGCTT TTTTCATCGA CCTGGAGGGC TGTATTAGAA CCCTCAGATA 



AACTTCCG 600 



GAAGCCTGCG G 



CGGGTTCCCT CCAGGAGTGG TGAACATTGT GCCAGGATTC GGGCCCACAG TGGGAGCAGC 780 



ACTGGTTAAA GAAGCTGCGT CCCGGAGCAA TCTGAAGCGG GTGACGCTG 



r GTGCGGACGC TGACTIGGAC TTGGCAGTGG AGTGTGCCCA 960 
TCAGGGAGTG TTCTTCAACC AAGGCCAGTG TTGCACGGCA GCCTCCAGGG TGTTCGTGGA 1020 
GGAGCAGGTC TACTCTGAGT TTGTCAGGCG GAGCGTGGAG TATGCCAAGA AACGGCCCGT 1080 
\ AAACAGAACA GGGGCCTCAG ATTGATCAAA AGCAGTTCGA 1140 
1200 
1260 
1320 



GTTCACAAAA AATCTCGACA AAGCCCTGAA GTTGGCTTCT GCCTTAGAGT CTGGAACGGT 1440 

ICTATGC ACAGGCTCCA TTTGGTGGCT TTAAAATGTC 1500 

3 GTGAATACGC TTTGGCCGAA TACACAGAAG TGAAAACTGT 1560 

A CTTGGCGACA AGAACCCC TG A AGGAAAGGG GGGGCTCCTT CCTCAAACAT 1520 

CGGACGGCGG AATGTGGCAG ATGAAATGTG CTGGAGGAAA AAAATGACAT TTCTGACCTT 1680 

CCCGGGACAC ATTCTTCTGG AGGCTTTACA TCTACTGGAG TTGAATGATT GCTGTTTTCC 1740 

TCTCACTCTC CTGTTTATTC ACCAGACTGG GGATGCCTAT AGGTTGTCTG TGAAATCGCA 1800 

GTCCTGCCTG GGGAGGGAGC TGTTGGCCAT TTCTGTGTTT CCCTTTAAAC CAGATCCTGG 1860 

AGACAGTGAG ATACTCAGGG CGTTGTTAAC AGGGAGTGGT ATTTGAAGTG TCCAGCAGTT 1920 

GCTTGAAATG CTTTGCCGAA TCTGACTCCA GTAAGAATGT GGGAAAACCC CCTGTGTGTT 1980 
CTGCAAGCAG GGCTCTTGCA CCAGCGGTCT CCTCAGG 
CACGCCTCTT TCCGAGGTGA AGGTGGGACC ATTCCTTGGG A 

TTTTGGTTTT TGTTTTTTGT TTTCTTCTTT TTAAAAAAAG GATTTCACAG TGAGAAAGTT 2160 

TTGGTTAGTG CATACCGTGG AAGGGCGCCA GGGTCTTTGT GGATTGCATG TTGACATTGA 2220 

CCGTGAGATT CGGCTTCAAA CCAATACTGC CTTTGGAATA TGACAGAATC AATAGCCCAG 2280 

AGAGCTTAGT CAAAGACGAT ATCACGGTCT ACCTTAACCA AGGCACTTTC OTAAGCAGAA 2340 

AATATTGITG AGGTTACCTT TGCTGCTAAA GATCCAATCT TCTAACGCCA CAACAGCATA 2400 

GCAAATCCTA GGATAATTCA CCTCCTCATT TGACAAATCA GAGCTGTAAT TCACTTTAAC 2460 

AAATTACGCA TTTCTATCAC GTTCACTAAC AGCTTATGAT AAGTCTGTGT AGTCTTCCTT 2520 

TTCTCCAGTT ■ CTGTTACCCA ATTTAGATTA GTAAAGCGTA CACAACTGGA AAGACTGCTG 2580 

TAATAACACA GCCTTGTTAT TTTTAAGTCC TATTTTGATA TTAATTTCTG ATTAGTTAGT 2640 

AAATAACACC TGGATTCTAT GGAGGACCTC GGTCTTCATC CAAGTGGCCT GAGTATTTCA 2700 

CTGGCAGGTT GTGAATTTTT CTTTTCCTCT TTGGGAATCC AAATGATGAT GTGCAATTTC 2760 

ATGTTTTAAC TTGGGAAACT GAAAGTGTTC CCATATAGCT TCAAAAACAA AAACAAATGT 2820 

GTTATCCGAC GGATACTTTT ATGGTTACTA ACTAGTACTT TCCTAATTGG GAAAGTAGTG 2880 

CTTAAGTTTG CAAATTAAGT TGGGGAGGGC AATAATAAAA TGAGGGCCCG TAACAGAACC 2940 

AGTGTGTGTA TAACGAAAAC CATGTATAAA ATGGGCCTAT CACCCTTGTC AGAGATATAA 3 000 

ATTACCACAT TTGGCTTCCC TTCATCAGCT AACACTTATC ACTTATACTA CCAATAACTT 3 060 

GTTAAATCAG GATTTGGCTT CATACACTGA ATTTTCAGTA TTTTATCTCA AGTAGATATA 3120 

GACACTAACC TTGATAGTGA TACGTTAGAG GGTTCCTATT CTTCCATTGT ACGATAATGT 3180 

CTTTAATATG AAATGCTACA TTATTTATAA TTGGTAGAGT TATTGTATGT TTTTATAGTT 3240 

GTAAGTACAC AGAGGTGGTA TATTTAAACT TCTGTAATAT ACTGTATTTA GAAATGGAAA 3300 

TATATATAGT GTTAGGTTTC ACTTCTTTTA AGGTTTACCC CTGTGGTGTG GTTTAAAAAT 3360 

CTATAGGCCT GGGAATTCCG ATCCTAGCTG CAGATCGCAT CCCACAATGC GAGAATGATA 3420 
AAATAAAATT GGATATTTGA GA 

SEP ID NO:86 PDT1 PROTEIN SEQUENCE 



N GQPDGKPPAL PRPIRNI*EVK FTKIFINNEW H 
CEVEEGDKED VDKAVEAAQV APQRGSPWRR LDALSRGRLL HQLADLVERD R 
DTGKPFLHAF FIDLEGCIRT LRYFAGWADK IOGKTIPTDD NWCFTRHEP IGVCGAITPW 
NFPLLMLVWK LAPALCCGNT HVLKPAEQTP LTALYLGSLI KEAGFPPGW NIVPGFGPTV 
GAAISSHPQI NKIAFTGSTE VGKLVKEAAS RSNLKRVTLE LGGKNPCIVC ADADUJLAVE 
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CAHQGVFFNQ GQCCTAASRV FVEEQVYSEF VRRSVEYAKK RPVGDPFDVK TEQGPQIDQK 
QFDKILELIE SGKKEGAKLE CGGSAMEDKG LFIKPTVFSE VTDNMRIAKE EIFGPVQPIL 
KFKSIEEVIK RANSTDYGLT AAVFTKNLDK ALKLASALES GTVWIKCYNA LYAQAPFGGF 
KMSGNGRELG EYALAEYTEV KTVTIKLGDK NP 

SEQ ID NO:87 PDV3 DNA SEQUENCE 

Nucleic Acid Accession*: NM_032642 

Coding sequence: 184-1 263 (underlined sequences correspond to start and stop codons) 

ii tv r r 

GACCATTAGC AGGCACCCAG GCCTGTCTTT GGCTCGGAAA CGGTGGCCCC CAATGTAGCC 
TAGTTTGAAC CTAGGAACTG CAGGACCAGA GAGATTCCAC TGGAGCCTGA TGGACGGGTG 
ACAGAGGGAA CCCTACTCTG GAAACTGTCA GTCCCAGGGC ACTGGGGAGG GCTGAGGCCG 
ACCATGCCCA GCCTGCTGCT GCTGTTCACG GCTGCTCTGC TGTCCAGCTG GGCTCAGCTT 
CTGACAGACG CCAACTCCTG GTGGTCATTA GCTTTGAACC CGGTGCAGAG ACCCGAGATG 
TTTATCATCG GTGCCCAGCC CGTGTGCAGT CAGCTTCCCG GGCTCTCCCC TGGCCAGAGG 
AAGCTGTGCC AATTGTACCA GGAGCACATG GCCTACATAG GGGAGGGAGC CAAGACTGGC 
ATCAAGGAAT GCCAGCAOCA GTTCCGGCAG CGGCGGTGGA ATTGCAGCAC AGCGGACAAC 
GCATCTGTCT TTGGGAGAGT CATGCAGATA GGCAGCCGAG AGACCGCCTT CACCCACGCG 
GTGAGCGCCG CGGGCGTGGT CAACGCCATO AGCCGGGCCT GCCGCGAGGG CGAGCTCTCC 
ACCTGCGGCT GCAGCCGGAC GGCGCGGCCC AAGGACCTGC CCCGGGACTG GCTGTGGGGC 
GGCTGTGGGG ACAACGTGGA GTACGGCTAC CGCTTCGCCA AGGAGTTTGT GGATGCCCGG 
GAGCGAGAGA AGAACTTTGC CAAAGGATCA GAGGAGCAGG GCCGGGTGCT CATGAACCTG 
CAAAACAACG AGGCCGGTCG CAGGGCTGTG TATAAGATGG CAGACGTAGC CTGCAAATGC 
CACGGCGTCT CGGGGTCCTG CAGCCTCAAG ACCTGCTGGC TGCAGCTGGC CGAGTTCCGC 
AAGGTCGGGG ACCGGCTGAA GGAGAAGTAC GACAGCGCGG CCGCCATGCG CGTCACCCGC 
AAGGGCCGGC TGGAGCTGGT CAACAGCCGC TTCACCCAGC CCACCCCGGA GGACCTGGTC 



TGGTGCTGCT TCGTCAGGTG TAAGAAGTGC ACGGAGATCG TGGACCAGTA CATCTGTAAA 
TAGCCCGGAG GGCCTGCTCC CGGCCCCCCC TCCACTCTGC CTCACAAAGG TCTATATTAT 
ATAAA TCTAT ATAAATCTAT TTTATATTTG TATAAGTAAA TGGGTGGGTG CTATACAATG 
GAAAGATGAA AATGGAAAGG AAGAGCTTAT TTAAGAGACG CTGGAGATCT CTGAGGAGTG 




CCCTAAACTG AATGTTTGCG CCTGGGCTGC AGAAGCCAGG GTGCATGACC AGGCTGCGTG 2040 

GACGTTATAC TGTCTTCCCC CACCCCCGGG GAGGGGAAGC TTGAGCTGCT GCTGTCACTC 2100 

CICCACCGAG GGAGGCCTCA CAAACCACAG GACGCTGCAA CGGGTCAGGC TGGCGGGCCC 2160 

GGCGTGCTCA TCATCTCTGC CCCAGGTGTA CGGTTTCTCT CTGACATTAA ATGCCCTTCA 2220 
TGGAAAAAAA AAAAAGAAAA AAAAAAAAAA AA 



11 21 31 41 51 

I I I I I 

A ALLSSWAQLL TDANSWWSLA LHPVQRPEMF IIGAQPVCSQ LPGLSFGQRK 
LCQLYQEHMA YIGEGAKTGI KECQHQFRQR RWNCSTADNA SVFGRVMQIG SRETAFTHAV 
SAAGWNAIS RACREGELST CGCSRTARPK DLPRDWLWGG CGDNVEYGYR FAKEFVDARE 
REKNFAKGSE EQGRVLMNLQ NNEAGRRAVY KMADVACKCH GVSGSCSLKT CWLQLAEFRK 
OU VGDRLKEKYD SAAAMRVTRK GRLELVNSRF TQPTPEDLVY VDPSPDYCLR NEETGSLGTQ 
GRLCNKTSEG MDGCELMCCG RGYNQFKSVO VERCHCKFHW CCFVRCKKCT EIVDQYICK- 

SEQ ID NO:89 PDT9 DNA SEQUENCE 

Nucleic Acid Accession*: NM_O33280 
65 Coding sequence: 58-636 (undefined sequerrces correspond to start and slop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCAGCCGTC TGTGCCACCC AGAGCCGGCG GGCCGCTAGG TCCCCGGAGA CCCTGCTATG 

70 GTGCGTGCGG GCGCCGTGGG GGCTCATCTC CCCGCGTCCG GCTTGGATAT CTTCGGGGAC 
CTGAAGAAGA TGAACAAGCG CCAGCTCTAT TACCAGGTTT TAAACTTCGC CATGATCGTG 
TCTTCTGCAC TCATGATATG GAAAGGCTTG ATCGTGCTCA CAGGCAGTGA GAGCCCCATC 
GTGGTGGTGC TGAGTGGCAG TATGGAGCCG GCCTTTCACA GAGGAGACCT CCTGTTCCTC 

_ _ ACAAATTTCC GGGAAGACCC AATCAGAGCT GGTGAAATAG TTGTTTTTAA AGTTGAAGGA 

75 CGAGACATTC CAATAGTTCA CAGAGTAATC AAAGTTCATG AAAAAGATAA TGGAGACATC 
AAATTTCTGA CTAAAGGAGA TAATAATGAA GTTGATGATA GAGGCTTGTA CAAAGAAGGC 
CAGAACTCK3C TGGAAAAGAA GGACGTGGTG GGAAGAGCAA GAGGGTTTTT ACCATATGTT 
GGTATGGTCA CCATBATAAT GAATGACTAT CCAAAATTGA AGTATGCTCT TTTGGCTGTA 
- ATGGGTGCAT ATGTGTTACT AAAACGTGAA TCCTAAAATG AGAAGCAGTT CCTGGGACCA 

80 GATTGAAATG AATTCTGTTG AAAAAGAGAA AAACTAATAT ATTTGAGATG TTCCATTTTC 
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VTTCACCAGT 780 



MVRAGAVGAH LPASGLDIFG DLKKMNKRQL YYQVLNFAHI VSSALMIHKG LIVLTGSESP 
IVWLSGSMB PAFHRGDLLF LTHFREDPIR AGE IWFKVE GRDIPIVHRV IKVHEKDNGD 
IKPLTKGDNN EVDDRGLYKE GQNWLEKKDV VGRARGFLPY VGMVTIIMND YPKFKYALLA 
VMGAWLLKR ES 

SEQIDNO:91 PDV5DNA SEQUENCE 

Nucleic Acid Accession #: NM_0 1 6590 

Coding sequence: 691-975 (underlined sequences correspond to start and stop codons) 



CGTGTCAGAA CTCAATTACG ACTACATATG CATTAAGGCA GGAACTGGCA GGCCTCAGGG 120 
TACGCCAACT ATAGGACTCG TGCTTCTCGT ACGCTGGGCT ATAATCTATG AAACTGAGCT 180 
CCAGAGCCAG CCAATCACTT AGCTCCTCAT AACAAGTCTA ACTGGCTCTG GAAAGCTGAA 240 
AGGGCTGCAC TGGAACAACA CAGATGAGAT ATTCTACACA TTAATCTACT TATCTGGAAT 300 

25 CACTTTGCCT CTAAAGGCCA GAGAAAAATC ACAGCTTCCT TGTCGGAGGG GAAAAGGACA 360 
GGTGATCTGG GGAAAACGCA GCTACACCTG GAGCAAGGTC TCTTCCCGGC TTGGCAATCT 420 
CAGCTGTGCC GGCGCTACGG GACCCGAGCC GTCCCAGAAA CCAAAGGGCA GGCACGGCAG 480 
CAAACGCCTG AGTGCTGCTG CCTTCGGTGA CTATATGAGA ATGGAAACTT CTAAGGAAGC 540 
CAGGTTGTTA GAATTGTTAC CCCCTTTACT CAGAGATAAC ATAGATTATC CAGGCTGAGA 600 

3U TGGAAAACAA GCCCTTTATT GAATTTTCAA CACAGACTCC CTGCTTCTCA TCTCCTTAAT 660 
AAAATTTCAT TAAAATCCCC TTGAACTCCC ATGTTCAAAT CTCCATTTGT TGACAGACAA 720 
AGCCAACAAT ACTCTAAACT GAGGCCTGCA AGTCATTTCA TTTGTATTTT TGTCCAGAAA 7B0 
TTTCCCATAG GAAGACTTCA CCTCCTACAA CTCCGAAGAA AACCCTTACT GTCCAAGACC 840 
GTCACCAGCA ACCATCCGCA GTCATTCAAG TGGAAGCTTT CACAGCTTTT GTACATTCTC 900 

35 TGTGTCAATA TACAACTGAG TTACAGACTG TCCCCTGGCT CCCTGACCCT TACAAACACT 960 
AAAAGTTTTG TTTGACTCAA CTTCAAGCTG CTCATCTGTT AGTAAGTGAT GTTCACTCCA 
GAACACATTC ATGATGAGAA CTTTCTAAAA GACCAGCACT GCTCTTCCCC TCCTATAATC 
ATAATAATCA TGATAACCTG AAACATGTTA CTGGGACTCG ACATTTTTCT GGGGATTGAA 
. ATCTTTAGTC CTTGGAGCTG TCACATAGCA GGGGCAACCT CACACTGAAA CAAAGGAAGT 

41) GATGTCCCAT TATTATCCAC CCTGAGCCAC CATAATATGC TGTTTACATT TATTTTCTTC 
AGCCTGTGCA AAACAAAGCA ATGGAAAAGG AAACTAAAAA ATATACJ 
ATCTTCTTTT GCCTAAAATT ACTAATGCAC CACGTCAGTC TGCTTCCTTC A 
CTCAATTCAT CAGGACTIGT ATTAGCAGGT TCTGGCTAGA GAGACTATCT CCTGTCATCA 
. _ CGATCAATTA ATGTTTTCTG GTGATCACAT CAGGCCCTAT CTAAGAAGCT CATGGTATAC 

45 AAGGGTCACC CAAATAGCTG AGTGCAGTCC TTGCTCATAT TTCCTTCATC TTAACCCCGC 
AAACAAGAAT TAAGATGATC CCAATAAAAG AAAAATTGCT CAGGAAACTG AACCTTTTTC 
TGAACCAAGC ACTGTCAGCA AATCTCAGGT ATTAGAGCAA CTATGGTTGA TTGAAAAGTG 
TCTCAAAATC TGGGCCAAGA ATGATTGCTA GGTCCATAAG CTAATTTGTC TGGCCTTGCC 
ATTTACGTAA GCCAAAGAAA GTCACTCATG AGTAAACTAT AGAAAACGTT CAGACCCATC 

50 CTGTTAGTAT GTCAAATCAA OTAAGACTGG CAGGGTATTA ACTCCATTCC AGGTGACATG 
GATAAAGAGC CCCATTMTT TCACAGTGCC AGCCTCTACC TAAGGAAACC CTAGACCTTG 
GAACCAGTTT CCTGGTAGGG AACTGCTGAC AGTTTCAATG CTGACAGTTG GAGCCAATGC 
CTCATAGTGT AAACTGAAAG AAAAATAGTT GCTTTTTAAA ATGTCAGCAA GAAGGCCTGC 2040 

c CTCATCTTAA CAAAGCAAAA AAAAATGCTT TAATTCAAAT TAAAAATCAT 



SEQ ID N0:93 PEE6 DNA SEQUENCE 

Nucleic Acid Accession*: NM_002606 

Coding sequence: 61-1842 (underlined sequences correspond to start and stop codons) 

11 21 31 41 51 

CGCGGCGGCT GGCGTCGGGA AAGTACAGTA AAAAGTCCGA GTGCAGCCGC CGGGCGCAGG 
ATGGGATCCG GCTCCTCCAG CTACCGGCCC AAGGCCATCT ACCTGGACAT CGATGGACGC 
ATTCAGAAGG TAATCTTCAG CAAGTACTGC AACTCCAGCG ACATCATGGA CCTGTTCTGC 
ATCGCCACCG GCCTGCCTCG GAACACGACC ATCTCCCTGC TGACCACCGA CGACGCCATG 
GTCTCCATCG ACCCCACCAT GCCCGCGAAT TCAGAACGCA CTCCGTACAA AGTGAGACCT 
GK3GCCATCA AGCAACTCTC CGCTGGTGTC GAGGACAAGA GAACCACAAG CCGTGGCCAG 
TCTGCTGAGA GACCACTGAG GGACAGACGG GTTGTGGGCC TGGAGCAGCC CCGGAGGGAA 
GGAGCATTTG AAAGTGGACA GGTAGAGCCC AGGCCCAGAG AGCCCCAGGG CTGCTACCAG 
GAAGGCCAGC GCATCCCTCC AGAGAGAGAA GAATTAATCC AGAGCGTGCT GGCGCAGGTT 
GCAGAGCAGT TCTCAAGAGC ATTCAAAATC AATGAACTGA AAGCTGAAGT TGCAAATCAC 
TTGGCTGTCC TAGAGAAACG CGTGGAATTG GAAGGACTAA AAGTCGTGGA GATTGAGAAA' 
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25 
30 
35 
40 



TGCAAGAGTG ACATTAAGAA GATGAGGGAG GAGCTGGCGG CCAGAAGCAG CAGGACCAAC 
TGCCCCTGTA AGTACAGTTT TTTGGATAAC CACAAGAAGT TGACTCCTCG ACGCGATGTT 
CCCACTTACC CCAAGTACCT GCTCTCTCCA GAGACCATCG AGGCCCTGCG GAAGCCGACC 
TTTGACGTCT GGCTTTGGGA GCCCAATGAG ATGCTGAGCT GCCTGGAGCA CATGTACCAC 
GACCTCGGGC TGGTCAGGGA CTTCAGCATC AACCCTGTCA CCCTCAGGAG GTGGCTGTTC 
TGTGTCCACG ACAACTACAG AAACAACCCC TTCCACAACT TCCGGCACTG CTTCTGCGTG 
GCCCAGATGA TGTACAGCAT GGTCIGGCTC TGCAGTCTCC AGGAGAAGTT CTCACAAACG 
GATATCCTGA TCCTAATGAC AGCGGCCATC TGCCACGATC TGGACCATCC CGGCTACAAC 
AACACGTACC AGATCAATGC CCGCACAGAG CTGGCGGTCC GCTACAATGA CATCTCACCG 
CTGGAGAACC ACCACTGCGC CGTGGCCTTC CAGATCCTCG CCGAGCCTGA GTGCAACATC 
TTCTCCAACA TCCCACCTGA TGGGTTCAAG CAGATCCGAC AGGGAATGAT CACATTAATC 
TTGGCCACTG ACATGGCAAG ACATGCAGAA ATTATGGATT CTTTCAAAGA GAAAATGGAG 
AATTTTGACT ACAGCAACGA GGAGCACATG ACCCTGCTGA AGATGATTTT GATAAAATGC 
TGTGATATCT CTAACGAGGT CCGTCCAATG GAAGTCGCAG AGCCTTGGGT GGACTGTTTA : 
TTAGAGGAAT ATTTTATGCA GAGCGACCGT GAGAAGTCAG AAGGCCTTCC TGTGGCACCG 
TTCATGGACC GAGACAAAGT GACCAAGGCC ACAGCCCAGA TTGGGTTCAT CAAGTTTGTC 
CTGATCCCAA TGTTTGAAAC AGTGACCAAG CTCTTCCCCA TGGTTGAGGA GATCATGCTG 
CAGCCACTTT GGGAATCCCG AGATCGCTAC GAGGAGCTGA AGCGGATAGA TGACGCCATG 
AAAGAGTTAC AGAAGAAGAC TGACAGCTTG ACGTCTGGGG CCACCGAGAA GTCCAGAGAG 
AGAAGCAGAG ATGTCAAAAA CAGTGAAGGA GACTGTGCC T GA GGAAAGCG GGGGGCGTGG 
CTGCAGTTCT GGACGGGCTC GCCGAGCTGC GCGGGATCCT IGTGCAGGGA AGAGCTGCCC 
TGGGCACCTG GCACCACAAG ACCATGTTTT CTAAGAACCA TTTTGTTCAC TGATACAAAA : 



I 11 21 31 41 51 

II I I I I 

MGSGSSSYRP KAIYLDIDGR IQKVIFSKYC NSSDIMDLFC IATGLPF 

VSIDPTMPAN SERTPYKVRP VAIKQLSAGV EDKRTTSRGQ SAERPLRDRR WGLEQPRRE 
GAFESGQVEP RPREPQGCYQ EGQRIPPERE ELIQSVLAQV AEQFSRAFKI K 
LAVLEKRVEL EGLKWEIEK CKSDIKKMRE ELAARSSRTN CPCKYSJ 
PTYPKYLLSP ETIEALRKPT FDVWLWEPNE MLSCLEHMYH DLGLVRDFSI NFVTLRRWLF 
CVHDNYRNNP FHNFRHCFCV AQMMYSMVWL CSLQEKFSQT DILILHTAAI GHDLDHPGYN 
NTYQINARTE LAVRYNDISP LENHHCAVAF QILAEPECNI FSNIPPDGFK QIRQGMITLI 
LATDMARHAE IHDSFKEKME NFDYSNEEHM TLLKMILIKC CDISNE\ 
LEEYFMQSDR EKSEGLPVAP FMDRDKVTKA TAQIGFIKFV LIPMFETVTK LFPM 
QPLWESRDRY EELKRIDDAM KELQKKTDSL TSGATEKSRE RSRDVKNSEG DCA 

SEQ ID NO:95 PEG4 DNA SEQUENCE 



TGCCCTCCTG GGGGGCCCCA CCTGGGCAGG GAAGATGTAT GGCCCTGGAG GAGGCAAGTA 
TTTCAGCACC ACTGAAGACT ACGACCATGA AATCACAGGG CTGCGGGTGT CTGTAGGTCT 
TCTCCTGGTG AAAAGTGTCC AGGTGAAACT TOGAGACTCC TGGGACGTGA AACTGGGAGC 
CTTAGGTGGG AATACCCAGG AAGTCACCCT GCAGCCAGGC GAATACATCA CAAAAGTCTT 
TGTCGCCTTC CAAGCTTTCC TCCGGGGTAT GGTCATGTAC ACCAGCAAGG ACCGCTATTT 
PTGGG AAGCTTGATC GCCAGATCTC CTCTGCCTAC C 

GAATTATCCA CTAGAGGAGC CGACCACTGA GCCACCAGTT AATCTCACAT A 
CTCACCCGTG GGTCGCTAGG GTGGGGTATG GGGCCATCCG AGCTGAGGCC ATCTGTGTGG 600 
TGGTGGCTGA TGGTACTGGA GTAACTGAGT CGGGACGCTG AATCTGAATC CACCAATAAA 660 
TAAAGCTTCT Gl 




MLLLLTLALL GGPTWAGKMY GPGGGKYFST TEDYDHEITG LRVSVGLLLV KSVOVKLGDS 
WDVKLGALGG NTQEVTLQPG EYITKVFVAF QAFLRGMVMY TSKDRYFYFG KLDGQISSAY 
PSQEGQVLVG IYGQYQLLGI KSIGFEWNYP LEEPTTEPPV NLTYSANS PV GR 

SEQ ID NO: 97 PEL9 DNA SEQUENCE 

Nucleic Add Accession*: NMJ06953 

Coding sequence: 33-896(underiined sequences correspond to start and stop codons) 



CCGTTCCGCG CTCTCGCGGC TCCTCCCGGG CGATGCCTCC GCTCTGGGCC CTGCTGGCCC 
TCGGCTGCCT GCGGTTCGGC TCGGCTGTGA ACCTGCAGCC CCAACTCGCC AGTGTGACTT 
TCGCCACCAA CAACCCCACA CTTACCACTG TGGCCTTGGA AAAGCCTCTC TGCATGTTTG 



CCATTTCCAG GAATGCCTCA GTGCAAGACA GCACCAACAC CCCACTGGGC TCAACGTTCC 
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TACAAACAGA GGGTGGGAGG ACAGGTCCCT ACAAAGCTGT GGCCTTTGAC CTGATCCCCT 360 

GCAGTGACCT GCCCAGCCTG GATGCCATTG GGGATGTGTC CAAGGCCTCA CAGATCCTGA 420 

ATGCCTACCT GGTCAGGGTG GGTGCCAACG GGACCTGCCT GTGGGATCCC AACTTCCAGG 460 

GCCTCTGTAA CGCACCCCTG TCGGCAGCCA CGGAGTACAG GTTCAAGTAT GTCCTGGTCA 540 

ATATGTCCAC GGGCTTGGTA GAGGACCAGA CCCTGTGGTC GGACCCCATC CGCACCAACC 600 

AGCTCACCCC ATACTCGACG ATCGACACGT GGCCAGGCCG GCGGAGCGGA GGCATGATCG 660 

TCATCACTTC CATCCTGGGC TCCCTGCCCT TCTTTCTACT TGTGGGTTTT GCTGGCGCCA 720 

TTGCCCTCAG CCTCGTGGAC ATGGGGAGTT CTGATGGGGA AACGACTCAC GACTCCCAAA 780 

TCACTCAGGA GGCTGTTCCC AAGTCGCTGG GGGCCTCGGA GTCTTCCTAC ACGTCCGTGA 840 

ACCGGGGGCC GCCACTGGAC AGGGCTGAGG TGTATTCCAG CAAGCTCCAA GACTGAGCCC 900 

AGCACCACCC CTGGGCAGCA GCATCCTCCT CTCTGGCCTT GCCCCAGGCC CTGCAGCGGT 960 

GGTTGTCACA CCCTC3ACTTC AGGGAAGGTG AAACAGGGCT TGTCCCTCCA ACTGCAGGAA 1020 
AACCCTTAAT AAAATCTTCT GATGAGTTCT AAAAAAAAA 



1 11 21 31 41 51 

I I I I I I 

MPPLWALLAIi GCLRFGSAVN LQPQLASVTF ATNMPTLTTV ALEKPLCMFD SKEALTGTHE 
VYLYVLVDSA ISRNASVQDS TNTPLGSTFL QTEGGRTGPY KAVAFDLIPC SDLPSLDA1G 
DVSKASQILN AYLVRVGANG TCLWDPNFQG LCNAPLSAAT EYRFKYVLVK MSTGLVEDQT 
LWSDPIRTNQ LTPYSTIDTW PGRRSGGMIV ITSILGSLPF FLLVGFAGAI ALSIiVDMGSS 
DGETTHDSQ1 TQEAVPKSLG ASESSYTSVN RGPPLDRAEV YSSKLQD 



SEQ ID NO: 99 PEN1 DNA SEQUENCE 

correspond to start and stop codons) 

1 n 21 31 41 51 

I I I I I I 

GTCTGACTTC CTCCCAGCAC ATTCCTGCAC TCTGCCGTGT CCACACTGCC CCACAGACCC 60 

AGTCCTCCAA GCCTGCTGCC AGCTCCCTGC AAGCCCCTCA GGTTGGGCCT TGCCACGGTG 120 

CCAGCAGGCA GCCCTGGGCT GGGGGTAGGG GACTCCCTAC AGGCACGCAG CCCTCAGACC 180 

TCAGAGGGCC ACCCCTTGAG GGTGGCCAGG CCCCCAGTGG CCAACCTGAG TGCTGCCTCT 240 

GCCACCAGCC CTGCTGGCCC CTGGTTCCGC TGGCCCCCCA GATGCCTGGC TGAGACACGC 300 

CAGTGGCCTC AGCTGCCCAC ACCTCTTCCC GGCCCCTGAA GTTGGCACTG CAGCAGACAG 360 

CTCCCTGGGC ACCAGGCAGC TAACAGACAC AGCCGCCAGC CCAAACAGCA GCGGCATGGG 420 

CAGCGCCAGC CCGGGTCTGA GCAGCGTATC CCCCAGCCAC CTCCTGCTGC CCCCCGACAC 480 

GGTGTCGCGG ACAGGCTTGG AGAAGGCGGC AGCGGGGGCA GTGGGTCTCG AGAGACGGGA 540 

CTGGAGTCCC AGTCCACCCG CCACGCCCGA GCAGGGCCTG TCCGCCTTCT ACCTCTCCTA 600 



GGGCAGCCTG GACTTGGTGC CCGGCGGGCT GACCTTGGAG GAGCACTCGC TGGAGCAGGT 
45 GCAGTCCATG GTGGTGGGCG AAGTGCTCAA GGACATCGAG ACGGCCTGCA AGCTCCTCAA 
CATCACCGCA GATCCCATGG ACTGGAGCCC CAGCAATGTG CAGAAGTGGC TCCTGTGGAC 
AGAGCACCAA TACCGGCTGC CCCCCATCGG CAAGGCCTTC CAGGAGCTGG CGGGCAAGGA 
GCTGTGCGCC ATGTCGGAGG AGCAGTTCCG CCAGCGCTCG CCCCTGGGTG GGGATGTGCT 
GCACGCCCAC CTGGACATCT GGAAGTCAGC GGCCTGGATG AAAGAGCGGA CTTCACCTGG 
50 GGCGATTCAC TACTGTGCCT CGACCAGTGA GGAGAGCTGG ACCGACAGCG AGGTGGACTC 
ATCATGCTCC GGGCAGCCCA TCCACCTGTG GCAGTTCCTC AAGGAGTTGC TACTCAAGCC 
CCACAGCTAT GGCCGCTTCA TTAGGTGGCT CAACAAGGAG AAGGGCATCT TCAAAATTGA 
GGACTCAGCC CAGGTGGCCC GGCTGTGGGG CATCCGCAAG AACCGTCCCG CCATGAACTA 
_ _ CGACAAGCTG AGCCGCTCCA TCCGCCAGTA TTACAAGAAG GGCATCATCC GGAAGCCAGA 
55 CATCTCCCAG CGCCTCGTCT ACCAGTTCGT GCACCCCATC TGAGTGCCTG GCCCAGGGCC 
3 CCTGCCCTGC CTCAGCCAGG CCCTGAGATG 
C TGACCTTCCA GAGCCCAAGG TCAGGGAGGG 
GCAACCAACT GCCCCAGGGG GATATGGGTC CTCTGGGGCC TTCGGGACCA TGGGGCAGGG 
GTGCTTCCTC CTCAGGCCCA GCTGCTCCCC TGGAGGACAG AGGGAGACAG GGCTGCTCCC 
CAACACCTGC CTCTGACCCC AGCATTTCCA GAGCAGAGCC TACAGAAGGG CAGTGACTCG 
ACAAAGGCCA CAGGCAGTCC AGGCCTCTCT CTGCTCCATC CCCCTGCCTC CCATTCTGCA 
CCACACCTGG CATGGTGCAG GGAGACATCT GCACCCCTGA GTTGGGCAGC CAGGAGTGCC 
CCCGGGAATG GATAATAAAG ATACTAGAGA ACTG 



1 11 21 31 41 51 

I I I I I I 

MGSASPGLSS VSPSHDLLPP DTVS RTGLEK AAAGAVG LER RDWSPSPPAT PEQGLSAFYL 
SYFDMLYPED S SWAAKAPGA SSREEPPEEP EQCPVIDSQA PAGSLDLVEG GLTLEEHSLE 
QVQSMWGEV LKDIETACKL LNITADPMDW SPSKVQKWLL WTEHQYRLPP MGKAFQELAG 
KELCAMSEEQ FRQRSPLGGD VLHAHLDIWK SAAWMKERTS PGA1HYCAST SEESWTDSEV I 
DSSCSGQPIH LWQFLKELLL KPHSYGRF1R WLNKEKGIFK IEDSAQVARL WG1RKHRPAH 
NYDKLSRSIR QYYKKGIIRK PDISQRLVYQ FVHPI 

SEQ ID N0:101 PEN3 DNA SEQUENCE 
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I I 

GAGAGAACAG CGTGAGCCTG T 
GCTTGGGTTT CACCTGCAGA A 
CTGCATGAAG CCGTTCTGGC 1 
AGAGCTTGCC CAGCTG3 
GGGGTGTCTC CTAAACCCTC A 
GCrCTATTCT GTACCTG 

CCAAGCCAGG CTGGTTOTCT GCATCCTTTC AATGACCTGT TTTCTTCTGT AACCACAGGT 420 

TCGGTGGTGA GAGGAAGCCT CGCAGAATCC AGCAGAATCC TCACAGAATC CAGCAGCAGC 480 

TCTGCTGGGG ACATGGTCCA TGGTGCAACC CACAGCAAAG CCCTGACCTC ACCTCCTGAT 540 

GCTCAGGAGA AGCCATGGGC CCCTCCTGTC CTGTGTTCCT GTCCTTCACA AAGCTCAGCC 600 

TGTGGTGGCT CCTTCTGACC CCAGCAGGTG GAGAGGAAGC TAAGCGCCCA CCTCCCAGGG 660 

CTCCTGGAGA CCCACTCTCC TCTCCCAGTC CCACGGCATT GCCGCAGGGA GGCTCGCATA 720 

CCGAGACTGA GGACCGGCTC TTCAAACACC TCTTCCGGGG CTACAACCGC TGGGCGCGCC 780 

CGGTGCCCAA CACTTCAGAC GTGGTGATTG TGCGCTTTGG ACTGTCCATC GCTCAGCTCA 840 

TCGATGTGGA TGAGAAGAAC CAAATGATGA CCACCAACGT CTGGCTAAAA CAGGAGTGGA 900 

GCGACTACAA ACTGCGCTGG AACCCCGCTG ATTTTGGCAA CATCACATCT CTCAGGGTCC 960 

CTTCTGAGAT GATCTGGATC CCCGACATTG TTCTCTACAA CAATGCAGAT GGGGAGTTTG 1020 

CAGTGACCCA CATGACCAAG GCCCACCTCT TCTCCACGGG CACTGTGCAC TGGGTGCCCC 1080 

CGGCCATCTA CAAGAGCTCC TGCAGCATCG ACGTCACCTT CTTCCCCTTC GACCAGCAGA 1140 

ACTGCAAGAT GAAGTTTGGC TCCTGGACTT ATGACAAGGC CAAGATCGAC CTGGAGCAGA 1200 

TGGAGCAGAC TGTGGACCTG AAGGACTACT GGGAGAGCGG CGAGTGGGCC ATCGTCAATG 1260 

CCACGGGCAC CTACAACAGC AAGAAGTACG ACTGCTGCGC CGAGATCTAC CCCGACGTCA 1320 

CCTACGCCTT CGTCATCCGG CGGCTGCCGC TCTTCTACAC CATCAACCTC ATCATCCCCT 1380 

GCCTGCTCAT CTCCTGCCTC ACTGTGCTGG TCTTCTACCT GCCCTCCGAC TGCGGCGAGA 1440 

AGATCACGCT GTGCATTTCG GTGCTGCTGT CACTCACCGT CTTCCTGCTG CTCATCACTG 1500 

C CGCTCATCGG CGAGTACCTG CTGTTCACCA 1560 
1620 

1740 




1860 
1920 

AGGAGGGTGA GCTGCTGCTA TCACCCCACA TGCAGAAGGC ACTGGAAGGT GTGCACTACA 1980 
TTGCCGACCA CCTGCGGTCT GAGGATGCTG ACTCTTCGGT GAAGGAGGAC TGGAAGTATG 2040 
TTGCCATGGT CATCGACAGG ATCTTCCTCT GGCTGTTTAT C&TCGTCTGC TTCCTGGGGA 2100 

2160 
2220 

G AACAGCCAGG TGAGGTGGGA GGTTGGAGAG 2280 
G TCTCTCTAAG TCAGGCTGGG GTTGAAGTTT GGAGTCTGTC CGAGTTTGCA 2340 
GGGTGCTGAG CTGTATGGTC CAGCAGGGGA GTAATAAGGG CTCTTCCGGA AGGGGAGGAA 2400 
GCGGGAGGCA GGCCTGCACC TGATGTGGAG GTACAGGCAG ATCTTCCCTA CCGGGGAGGG 2460 
ATGGATGGTT GGATACAGGT GGCTGGGCTA TTCCATCCAT CTGGAAGCAC ATTTGAGCCT 2520 
CCAGGCTTCT CCTTGACGTC ATTCGTCTCC TTCCTTGCTG CAAAATGGCT CTGCACCAGC 2580 
CGGCCCCCAG GAGGTC TGGC AGAGCTGAGA GCCATGGCCT GCAGGGGCTC CATATGTCCC 2640 
TACGCGTGCA GCAGGCAAAC AAGA 



5U Protein Accession*: NP_O0O733 

1 11 21 31 41 51 

I I I I I I 

MGPSCPVFLS FTKLSLWWLL LTPAGGEEAK RPPPRAPGDP LSSPSPTALP QGGSHTETED 

55 RLFKHLFRGY HRWARFVPNT SDWIVRFGL SIAQLIDVDE KNOMMTTNW LKQEWSDYKL 
RWNPADFGNI TSLRVPSEM1 WIPDIVLYNH ADGEFAVTHM TKAHLFSTGT VHWVPPA1YK 
SSCSIDVTFF PFDQQNCKMK FGSWTYDKAK IDLEQMEQTV DLKDYWESGE WAIVNATGTY 
NSKKYDCCAE IYPDVTYAEV IRRLFLFYTI NLIIPCLL1S CLTVLVFYLP SDCGEKITLC 
IEVLLSLTVF LLLITEIIPS TSLVIPLIGE YLLFTMIFVT LSIVITVFVL NVHHRSPSTH 

OU TMPHWVRGAL LGCVPRWLLM NRPPPPVELC HPLRLKLSPS YHWLESNVDA EEREVWEEE 
DRWACAGHVA PSVGTLCSHG HLHSGASGPK AEALLOEGEL LLSPHMQKAL EGVHYIADHL 
RSEDADSSVK EDWKYVAMVI DR1FLWLFII VCFLGTIGLF LPPFLAGMI 

,_ SEQ ID NO:103 PEU4 DNA SEQUENCE 

65 Nucleic Acid Accession*: NMJU8670 

Coding sequence: 87-893 (underlined sequences correspond to start and stop codons) 

l 11 21 31 41 51 

_ n I I I I I I 

/U CACGAGGCTG GAAGGGGCCA CTTCACACCT CGGGCTCGGC ATAAAGCGGC CGCCGGCCGC 

CGGCCCCCAG ACGCGCCGCC GCTGCCATGG CCCAGCCCCT GtGCCCGCCG CTCTCCGAGT 
CCTGGATGCT CTCTGCGGCC TGGGGCCCAA CTCGGCGGCC GCCGCCCTCC GACAAGGACT 
GCGGCCGCTC CCTCGTCTCG TCCCCAGACT CATGGGGCAG CACCCCAGCC GACAGCCCCG 
_ _ TGGCGAGCCC CGCGCGGCCA GGCACCCTCC GGGACCCCCG CGCCCCCTCC GTAGGTAGGC 

75 GCGGCGCGCG CAGCAGCCGC CTGGGCAGCG GGCAGAGGCA GAGCGCCAGT GAGCGGGAGA 

AACTGCGCAT GCGCACGCTG GCCCGCGCCC TGCACGAGCT GCGCCGCTTT CTACCGCCGT 
CCGTGGCGCC CGCGGGCCAG AGCCTGACCA AGAICGAGAC GCTGCGCCTG GCTATCCGCT 
ATATCGGCCA CCTGTCGGCC GTGCTAGGCC TCAGCGAGGA GAGTCTCCAG CGCCGGTGCC 
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TATCCGCCGT CCGCGCCGGG GCGTCCTGGG GATCCCCGCC TGCCTGCCCC GGAGCCCGAG 720 

CTGCACCCGA GCCGCGCGAC CCGCCTGCGC TGTTCGCCGA GGCGGCGTGC CCGGAAGGGC 780 

AGGCGATGGA GCCAAGCCCA CCGTCCCCGC TCCTTCCGGG CGACGTGCTG GCTCTGTTGG 840 

AGACCTGGAT GCCCCTCTCG CCTCTGGAGT GGCTGCCTGA GGAGCCCAAG TgACAAGGGA 900 

CAACTGACGC CGTCTCTGTG AGCACCGAGG CTTTTTGGCC TCAGCACCTT CGAAGTGGTT 960 

CCTTGGCAGA CTGCCTTTCC TGGAAGAGGG CACGG GCG AT GCCGACGGGG GCATTCCTGC 1020 

GGGTGAGAGC CGTCCCCACC GCGGCGGCCC TTCTCAGCCC CTCCCTCCAT GGAGGGACCC 1080 

ATAGGGCTAG ACACTTTGAG GCAAGCAGGA GGCTCTGCCT AATGTGAATT TATTTATTTG 1140 

TGAATAAACT GTACTGGTGT CAAAAAAAAA AAAAAAAAAA A 

SEq ID NQ:iq4PEU4 Pr ote in se q uw 



MAQPLCPPLS ESWMLSAAKG PTRRPPPSDK DCGRSLVSSP DSWGSTPADS PVASPARPGT 
LRDPRAPEVG RRGARSSRLG SGQRQSASER EKLRMRTLAR ALHELRRFLP PSVAPAGQSL 
TKIETLRLAI RYIGHLSAVL GLSEESLQRR CRQRGDAGSP RGCPLCPDDC PAOMQTRTOA 
EGQGQGRGLG LVSAVRAGAS WGSPPACPGA RAAPEPRDPP ALFAEAACPE GQAHEPEPPS 
LETWMPLSPL EWLPEEPK 

SEQIDNO:105 PEUSDNA SEQUENCE 

#: NM_0 17636 

324-3374 (underlined sequences correspond to start and stop codons) 




G GAGTCCTACA TCTCACAGCA GAAGACGGGC GTGGGAGGGA 
CTGGAATTGA CATCCCTGTC CTGCTCCTCC TGATTGATGG TGATGAGAAG ATGTTGACGC 
GAATAGAGAA CGCCACCCAG GCTCAGCTCC CATGTCTCCT CGTGGCTGGC TCAGGGGGAG 
CTGCGGACTG CCTGGCGGAG ACCCTGGAAG ACACTCTGGC CCCAGGGAGT GGGGGAGCCA 

TGCAGGCCCA GGTGGAGAGG ATTATGACCC GGAAGGAGCT CCTGACAGTC TATTCTTCTG 
AGGATGGGTC TGAGGAATTC GAGACCATAG TTTTGAAGGC CCTTGTGAAG GCCTGTGGGA 
GCTCGGAGGC CTCAGCCTAC CTGGATGAGC TGCGTTTGGC TGTGGCTTGG AACCGCGTGG 
ACATTGCCCA GAGTGAACTC TTTCGGGGGG ACATCCAATG GCGGTCCTTC CATCTCGAAG 
CTTCCCTCAT GGACGCCCTG CTGAATGACC SGCCTGAGTT CGTGCGCTTG CTCATTTCCC 
ACGGCCTCAG CCTGGGCCAC TTCCTGACCC CGATGCGCCT GGCCCAACTC TACAGCGCGG 
CGCCCTCCAA CTCGCTCATC CGCAACCTTT TGGACCAGGC GTCCCACAGC GCAGGCACOA 
AAGCCCCAGC CCTAAAAGGG GGAGCTGCGG AGCTCCGGCC CCCTGACGTG GGGCATGTGC 
TGAGGATGCT GCTGGGGAAG ATGTGCGCGC CGAGGTACCC CTCCGGGGGC GCCTGGGACC 
CTCACCCAGG CCAGGGCTTC GGGGAGAGCA TGTATCTGCT CTCGGACAAG GCCACCTCGC 
CGCICTCGCT GGATGCTGGC CTCGGGCAGG CCCCCTGGAG CGACCTGCTT CTTTGGGCAC 
TGTTGCTGAA CAGGGCACAG ATGGCCATGT ACTTCTGGGA GATGGGTTCC AATGCAGTTT 
CCTCAGCTCT TGGGGCCTGT TTGCTGCTCC GGGTGATGGC ACGCCTGGAG CCTGACGCTG 
AGGAGGCAGC ACGGAGGAAA GACCTGGCGT TCAAGTTTGA GGGGATGGGC GTTGACCTCT 
C CGTCGCTGCC 
C CGTGCCTTCT 1800 
.860 

CTACACCCAT CTCGGCCCTG GTTCTCGCCT TCTTTTGCCC TCCACTCATC TACACCCGCC 1920 
TCATCACCTT CAGGAAATCA GAAGAGGAGC CCACACGGGA GGAGCTAGAG TTTGACATGG 
ATAGTGTCAT TAATGGGGAA GGGCCTGTCG GGACGGCGGA CCCAGCCGAG AAGACGCCGC 2040 
TGGGGGTCCC GCGCCAGTCG GGCCGTCCGG GTTGCTGCGG GGGCCGCTGC GGGGGGCGCC 2100 
G CCGCTGGTTC CACTTCTGGG GCGCGCCGGT GACCATCTTC ATGGGCAACG 2160 
T TCTCGCGGGT GCTGCTCGTG GATTTCCAGC 2220 
G GAGCTGCTGC TCTATTTCTG GGCTTTCACG CTGCTGTGCG 2280 
AGGAACTGCG CCAGGGCCTG AGCGGAGGCG GGGGCAGCCT CGCCAGCGGG GGCCCCGGGC 2340 
CTGGCCATGC CTCACTGAGC CAGCGCCTGC GCCTCTACCT CGCCGACAGC TGGAACCAGT 2400 
GCGACCTAGT GGCTCTCACC TGCTTCCTCC TGGGCGTGGG CTGCCGGCTC ACCCCGGGTT 2460 
T GGGCCGCACT GTCCTCTGCA TCGACTTCAT GGTTTTCACG GTGCGGCTGC 2520 
T CACGGTCAAC AAACAGCTGG GGCCCAAGAT CGTCATCGTG AGCAAGATGA 2580 
C CTCTTCTTCC TCGGCGTGTG GCTGGTAGCC TATGGCGTGG 2640 
CCACGGAGGG GCTCCTGAGG CCACGGGACA GTGACTTCCC AAGTATCCTG CGCCGCGTCT 2700 
TCTACCGTCC CTACCTGCAG ATCTTCGGGC AGATTCCCCA GGAGGACATG GACGTGGCCC 2760 
TCATGGAGCA CAGCMCTGC TCGTCGGAGC CCGGCTTCTG GGCACACCCT CCTGGGGCCC 2620 
AGGCGGGCAC CTGCGTCTCC CAGTATGCCA ACTGGCTGGT GGTGCTGCTC CTCGTCATCT 2880 
TCCTGCTCGT GGCCMCATC CTGCTGGTCA ACTTGCTCAT TGCCATGTTC AGTTACACAT 2940 
TCGGCAAAGT ACAGGGCAAC AGCGATCTCT ACTGGAAGGC GCAGCGTTAC CGCCTCATCC 3000 
GGGAATTCCA CTCTCGGCCC GCGCTGGCCC CGCCCTTTAT CGTCATCTCC CACTTGCGCC 3060 
TCCTGCTCAG GCAATTGTGC AGGCGACCCC GGAGCCCCCA GCCGTCCTCC CCGGCCCTCG 3120 
AGCATTTCCG GGTTTACCTT TCTAAGGAAG CCGAGCGGAA GCTGCTAACG TGGGAATCGG 3180 
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45 
50 

55 
60 
65 
70 
75 



TGCATAAGGA GAACTTTCTG CTGGCACGCG CTAGGGACAA GCGGGAGAGC GACTCCGAGC 3240 

GTCTGGAGCG CACGTCCCAG AAGGTGGACT TGGCACTGAA ACAGCTGGGA CACATCCGCG 3300 

AGTACGAACA GCGCCTGAAA GTGCTGGAGC GGGAGGTCCA GCAGTGTAGC CGCGTCCTGG 33S0 

GGTGGGTGAC GTAGGCCGTT AGCAGCTCTG CCATGTTGCC CTCAGGTGGG CCGCCACCCC 3420 

TTGACCTGCA TGGGTCCAAA GAGTGAGCCA TGCTGGCGGA TTTTAAGGAG AAGCCCCCAC 3480 

AGGGGATTTT GCTCTTAGAG TAAGGCTCAT GTGGGCCTCG GCCCCCGCAC CTGGTGGCCT 3540 

TGTCCTTGAG GTGAGCCCCA TGTCCATCTG GGCCACTGTC AGGACCACCT TTGGGAGTGT 3600 

CATCCTTACA AACCACAGCA TGCCCGGCTC CTCCCAGAAC CAGTCCCAGC CTGGGAGGAT 3660 

CAAGGCCTGG ATCCCGGGCC GTTATCCATC TGGAGGCTGC AGGGTCCTTG GGGTAACAGG 3720 



1 11 21 31 41 51 

I I I I I I 

MASTGGTKW AMGVAPWGW RNRDTLINPK GSFPARYRWR GDPEDGVQFP LDYNYSAFFL 

VDDGTHGCLG GENRFRLRLE SYISQQKTGV GGTGIDIPVL LLLIDGDEKM LTRIENATQA 
QLPCLLVAGS GGAADCLAET LEDTLAPGSG GARQGEARDR IRRFFPKGDL EVLQAQVERI 

WTRKELLTVY SSEDGSEEFE TIVLKALVKA CGSSEASAYL DELRLAVAWN RVDIAQSEIiF I 

RGDIOWRSFH LEASLMDALL NDRPEFVRLL ISHGLSLGHF LTPMRLAQLY SAAPSNSLIR : 

NLLDQASHSA GTKAPALKGG AAELRPPDVG HVLRMLLGKM CAPRYPSGGA WDPHPGQGFG : 

ESMYLLSDKA TSPLSLDAGL GQAPWSDLLL WALLLNRAQK AMYFWEMGSN AVSSALGACL 

LLRVMARLEP DAEEAARRKD LAFKFEGMGV DLFGECYRSS EVRAARLLLR RCPLWGDATC 

LQLAMQADAR AFFAQDGVQS LLTQKWWGDM ASTTPIWAW LAFFCPPLIY TRLITFRKSE ! 

EE PTREELEF DMDSVINGEG PVGTADPAEK TPLGVPRQSG RPGCCGGRCG GRRC LRRWFH I 

FWGAPVTIFM GNWSYLLFL LLFSRVLLVD FQPAPPGSLE LLLYFWAFTL LCEELRQGLS I 

GGGGSLASGG PGPGHASLSQ RLRLYLADSW NQCDLVALTC FLLGVGCRLT PGLYHLGRTV 

LC1DFMVFTV RLLHIFTVNK QLGPKIVIVS KMMKDVFFFL FFLGVWLVAY GVATEGLLRP 

RDSDFPSILR RVFYRPYLQI FGQIPQEDMD VALMEHSNCS SEPGFWAHPP GAQAGTCVSQ I 

YANWLWLLL VIFLLVANIL LVNLLIAMFS YTPGKVQGNS DLYWKAQRYR LIREFHSRPA ! 

LAPPFIVIEH LRLLLRQLCR RPRSPQPSSP ALEHFRVYLS KEAERKLLTW ESVHKENFLL ! 

ARARDKRESD SERLERTSQK VDLALKQLGH IREYEQRLKV LEREVQQCSR VLGWVT 

SEQ ID NO.-107 PEW3 ONA SEQUENCE 

Nucleic Acid Accession #: NMJM5982 

Coding sequence: 276-1 130 (underlined sequences correspond to start and slop codons) 



GGTAGCAGCA TCCACCGGGC GGGAGGTCGG AGGCAGCAAG GCCTTAAAGG CTACTGAGTG 
CGCCGGCCGT TCCGTGTCCA GAACCTCCCC TACTCCTCCG CCTTCTCTTC CTTGGCCGCC 
CACCGCCAAG TTCCGAC TCC GGTTTTCGCC TTTGCAAAGC CTAAGGAGGA GGTTAGGAAC 
C CCCTCCCTGC GGCCGCCGCC CCCTGCCICT CGGCTCTGCT O 

A GCAAGTGGCG TGCGTGTGCG AGGTTCTGCA GCAAGGCGGA A 

GCCTGGGCAG GTTCCTGTGG TCACTGCCCG CCTGCGACCA CCTGCACAAG AACGAGAGCG 420 

TACTCAAGGC CAAGGCGGTG GTCGCCTTCC ACCGCGGCAA CTTCCGTGAG CTCTACAAGA 480 

TCCTGGAGAG CCACCAGTTC TCGCCTCACA ACCACCCCAA ACTGCAGCAA CTGTGGCTGA 540 

AGGCGCATTA CGTGGAGGCC GAGAAGCTGC GCGGCCGACC CCTGGGCGCC GTGGGCAAAT 600 

ATCGGGTGCG CCGAAAATTT CCACTGCCGC GCACCATCTG GGACGGCGAG GAGACCAGCT 660 

ACTGCTTCAA GGAGAAGTCG AGGGGTGTCC TGCGGGAGTG GTACGCGCAC AATCCCTACC 720 

CATCGCCGCG TGAGAAGCGG GAGCTGGCCG AGGCCACCGG CCTCACCACC ACCCAGGTCA 780 

GCAACTGGTT TAAGAACCGG AGGCAAAGAG ACCGGGCCGC GGAGGCCAAG GAAAGGGAGA 840 

V CAATAACTCC TCCTCCAACA AGCAGAACCA ACTCTCTCCT CTGGAAGGGG 900 

P CATGTCCAGC TCAGAAGAGG AATTCTCACC TCCCCAAAGT CCAGACCAGA 960 

ACTCGGTCCT TCTGCTGCAG GGCAATATGG GCCACGCCAG GAGCTCAAAC TATTCTCTCC 1020 

CGGGCTTAAC AGCCTCGCAG CCCAGTCACG GCCTGCAGAC CCACCAGCAT CAGCTCCAAG 1080 

ACTCTCTGCT CGGCCCCCTC ACCTCCAGTC TGGTGGACTT GGGGTCCTAA GTGGGGAGGG 1140 

ACTGGGGCCT CGAAGGGATT CCTGGAGCAG CAACCACTGC AGCGACTAGG GACACTTGTA 1200 



L GRFLWSLPAC DHLHKNESVL KAKAWAFHR 



IWDGEETSYC FKEKSRGVLR EWYAHNPYPS PREKRELAEA TGLTTTQVSN WFKNRRQRDR 
AAEAKERENT ENHNSSSNKQ NQLSPLEGGK PLMSSSEEEF SPPQSPDQNS VLLLQGNMGH 
ARSSNYSLPG LTASQPSHGL QTHQHQLQDS LLGPLTSSLV DLGS 

SEQ ID NO:109 PFJ8 DNA SEQUENCE 

Nucleic Acid Accession*: NM.005069 



WO 02/30268 



PCT/US01/32045 



1 11 21 31 41 51 
I I I I I I 

GGGGCTCCGC GGGCCTGGAG CACGGCCGGG TCTAATATGC CCGGAGCCGA GGCGCGATQA 60 
AGGAGAAGTC CAAGAATGCG GCCAAGACCA GGAGGG AGAA GGAAAATGGC GAGTTTTACG 110 
5 AGCTTGCCAA GCTGCTCCCG CTGCCGTCGG CCATCACTTC GCAGCTGGAC AAAGCGTCCA 180 
TCATCCGCCT CACCACGAGC TACCTGAAGA TGCGCGCCGT CTTCCCCGAA GGTTTAGGAG 240 
ACGCGTGGGG ACAGCCGAGC CGCGCCGGGC CCCTGGACGG CGTCGCCAAG GAGCTGGGAT 300 
CGCACTTGCT GCAGACTTTG GATGGATTTG TTTTTGTGGT AGCATCTGAT GGCAAAATCA 360 
TGTATATATC CGAGACCGCT TCTGTCCATT TAGGCTTATC CCAGGTGGAG CTCACGGGCA 420 

10 ACAGTATTTA TGAATACATC CATCCTTCTG ACCACGATGA GATGACCGCT GTCCTCACGG 480 
CCCACCAGCC GCTGCACCAC CACCTGCTCC AAGAGTATGA GATAGAGAGG TCGTTCTTTC 540 
TTCGAATGAA ATGTGTCTTG GCGAAA AGGA ACGCGGGCCT GACCTGCAGC GGATAC A AGG 600 
TCATCCACTG CAGTGGCTAC TTGAAGATCA GGCAGTATAT GCTGGACATG TCCCTGTACG 660 
ACTCCTGCTA CCAGATTGTG GGGCTGGTGG CCGTGGGCCA GTCGCTGCCA CCCAGTGCCA 720 

15 TCACCGAGAT CAAGCTGTAC AGTAAC ATGT TCATGTTCAG GGCCAGCCTT GACCTGAAGC 780 
TGATATTCCT GGATTCCAGG GTGACCGAGG TGACGGGTTA CGAGCCGCAG GACCTGATCG 840 
AGAAGACCCT ATACCATCAC GTGCACGGCT GCGACGTGTT CCACCTCCGC TACGCACACC 900 
ACCTCCTGTT GGTGAAGGGC CAGGTCACCA CCAAGTACTA CCGGCTGCTG TCCAAGCGGG 960 
GCGGCTGGGT GTGGGTGCAG AGCTACGCCA CCGTGGTGCA CAACAGCCGC TCGTCCCGGC 1020 

20 CCCACTGCAT CGTGAGTGTC AATTATGTAC TCACGGAGAT TGAATACAAG GAACTTCAGC 1080 
TGTCCCTGGA GCAGGTGTCC ACTGCCAAGT CCCAGGACTC CTGGAGGACC GCCTTGTCTA 1 140 
CCTCACAAGA AACTAGGAAA TTAGTGAAAC CCAAAAATAC CAAGATGAAG ACAAAGCTGA 1200 
GAACAAACCC TTACCCCCCA CAGCAATACA GCTCGTTCCA AATGGACAAA CTGGAATGCG 1260 
GCCAGCTCGG AAACTGG AGA GCCAGTCCCC CTGCAAGCGC TGCTGCTCCT CCAGAACTGC 1320 

25 AGCCCCACTC AGAAAGCAGT GACCTTCTGT ACACGCCATC CTACAGCCTG CCCTTCTCCT 1380 
ACCATTACGG ACACTTCCCT CTGGACTCTC ACGTCTTCAG CAGCAAAAAG CCAATGTTGC 1440 
CGGCCAAGTT CGGGCAGCCC CAAGGATCCC CTTGTGAGGT GGCACGCTTT TTCCTGAGCA 1500 
CACTGCCAGC CAGCGGTGAA TGCCAGTGGC ATTATGCCAA CCCCCTAGTG CCIAGCAGCT 1560 

_ . CGTCTCCAGC TAA AAATCCT CCAGAGCCAC CGGCGAAC AC TGCTAGGCAC AGCCTGGTGC 1620 

30 CAAGCTACGA AGCGCCCGCC GCCGCCGTGC GCAGGTTCGG CGAGGACACC GCGCCCCCGA 1680 
GCTTCCCGAG CTGCGGCCAC TACCGCGAGG AGCCCGCGCT GGGCCCGGCC AAAGCCGCCC 1740 
GCCAGGCCGC CCGGGACGGG GCGCGGCTGG CGCTGGCCCG CGCGGCACCC GAGTGCTGCG 1800 
CGCCCCCGAC CCCCGAGGCC CCGGGCGCGC CGGCGCAGCT GCCCTTCGTG CTGCTCAACT I860 

. _ ACCACCGCGT GCTGGCCCGG CGCGGACCGC TGGGGGGCGC CGCACCCGCC GCCTCCGGCC 1920 

35 TGGCCTGCGC TCCCGGCGGC CCCGAGGCGG CGACCGGCGC GCTGCGGCTC CGGCACCCGA 1980 
GCCCCGCCGC CACCTCCCCG CCCGGCGCGC CCCTGCCGCA CTACCTGGGC GCCTCGGTCA 2040 
TCATCACCAA CGGGAGG1GA CCCGCTGGCC GCCCGCGCCA GGAGCCTGGA CCCGGCCTCC 2100 
CGGGGCTGCG GCGCCACCGA GCCCGGCAAA TGCGCACGAC CTACATTAAT TTATGCAGAG 2160 
. . ACAGCTGTTT GAATTGGACC CCGCCGCCGA CTTGCGG ATT TCCACCGCGG AGGCCCCGCG 2220 

40 CGCCGGTGCC GAGGGCCGAG GAGCGCCCGG GTCCGGGCAG GTGACCGCCC GCCTCTGTCC 2280 
TGCGAGGGCC GGTGCGACCC AGTTGCTGGG GGCTTGGTTT CCTCACCTTG AAATCGGGCT 2340 
TCACGCGTCT TGCCTTGTCC CCAACGTTCC ACAACAGTCC CGCTGGGGGA TTGAAGCGGT 2400 
TTCACTCCGC AAATATCCTC CACTTTCAGG AGGGAAAACC CACCCTACCA CAGTCCGCTC 2460 
TTCCAAGTGG ACGGCAGACC TGGGAGGGGA CGCCTGTGTC ACGAGCCCTT TTAGATGCTT 2520 

45 AGGTGAAGGC AGAAGTGATG ATTGTAAGTC CCATGAATAC ACAACTCCAC TGTCTTTAAA 2580 
AGTCATTCAA GAGTCTCATT ATTTTTGTTT TTATTTAACC CTTTCTTCAA TACAAAAAGC 2640 
CAACAAACCA AGACTAAGGG GGTGACCATG CAATTCCATT TTGTGTCTGT GAACATAGGT 2700 
GTGCTTCCCA AATACATTAA CAAGCTCTTA CTTCCCCCTA ACCCCTATGA ACTCTTGATA 2760 

_ _ ACACCAAGAG TAGCACCTTC AGAATATATT GAATAGGCAT TAAATGCAAA AATATATATG 2820 

50 TAGCCAGACA GTTTATGAGA ATGACCCTGT CAAGCTTCAT TATTACGTGG CAAAATCCCT 2880 
CTGGCCCACA CAGATCTGTA ATTCACTAGG CTCGTGTTTG CTACAAATAG TGCTAATAAA 2940 
GTTAAATTGC ACGTGCAATA CGGAACACTG TCAATGGACT GCACCTTGTG AAGGAAAAAC 3000 
ATGCTTAAGG GGGTGTAATG AAAATGATGT AGACATTTTA AGCATTTTCT ACACAGCG AG 3060 
AAAACTTCGT AAGAACATGT TACGTGTGCA ACAGGTAAAC AGAAATCCTT TCATAAAGCA 3120 

55 CCAGCAGTGT TTAAAAAATG AGCTTCCATT AATTTTTACT TTTTATGGGT TTTGCTTAAA 3180 
GATCTCAACA TGGAAAAATC CTGTCATGGC TCTGAACTGC ACAATGCATT GAACCGCCGT 3240 
CCTTCAATTT TCTTCACACT ATCAACACTG CAGCATTTTG CTGCTTTATC AAAATGGTTT 3300 
ATTTTAGGAA ACTTTTTCCA CCTTTCTGAA TGGAAAGAGG TTTTCACAAA TGTTTTAAAC 3360 

, _ TCATCGTTCT AAAATCAAGT GCACCTACAC CAACTGCTCT CAAAATGTGA ACTGACTTTT 3420 

60 TTnTTTTTT TTTTGCCAAC CCTGTGTCAC TTAGTGAGGA CCTGACACAA TCCCTACAGG 3480 
GTGTCTGTCA GTGGGCCTCA TGGTAAGAGT CACAATTTGC AAATTTAGGA CCGTGGGTCA 3540 
TGCAGCGAAG GGGCTGGATG GTAGGAAGGG ATGTGCCCGC CTCTCCACGC ACTCAGCTAT 3600 
ACCTCATTCA CAGCTCCTTG TGAGTGTGTG CACAGGAAAT AAGCCGAGGG TATTATTTTT 3660 
TTATGTTCAT GAGTCTTGTA ATTA AACCGT G ATTCTTG AA AGGTGTAGGT TTGATTACTA 3720 

65 GGAGATACCA CCGACATTTT TCAATAAAGT ACTGCAAAAT GCTTTTGTGT CTACCTTGTT 3780 
ATTAACTTTT GGGGCTGTAT TTAGTAAAAA TAAATCAAGG CTATCGGAGC AGTTCAATAA 3840 
CAAAGGTTAC TGTTGAGAAA AAAGACCCTA TCATAGATTT ACAAG 



70 

SEQ ID N0:110 PFJB Protein sequence: 
Protein Accession* NP_005060.1 



MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIKLTTSYL KMRAVFPEGL 60 
GDAWGQPSRA GPLDGVAKEL GSHLLQTUX3 FVFWASDGK IMYISETASV HLGLSQVELT 120 
GNSIYEYIHP SDHDEMTAVL TAHQPLHHHL LQEYE1ERSF FLRMKCVLAK RNAGLTCSGY 1 80 
KVIHCSGYLK IRQYMLDMSL YDSCYQIVGL VAVGQSLPPS AITEIKLYSN MFMFRASLDL 240 



341 



WO 02/30268 



KLIFLDSRVT EVTGYEPQDL EKTLYHHVH GCDVFHLRYA HHLLLVKGQV TTKYYRLLSK 300 
RGOWVWVQSY ATWHNSRSS RPHCIVSVNY VLTEEYKEL QLSLEQVSTA KSQDSWRTAL 360 
STSQETRKLV KPKNTKMKTK LRTNPYPPQQ YSSFQMDKLE CGQLGNWRAS PPAS AAAPPE 420 
LQPHSESSDL LYTPSYSLPF SYHYGHFPLD SHVFSSKKPM LPAKFGQPQG SPCEVARFFL 480 
STXPASGECQ WHYANPLVPS SSSPAKNPPE PPANTARHSL VPSYEAPAAA VRRFGEDTAP 540 
PSFPSCGHYR EEPALGPAKA ARQAARDGAR LALARAAPEC CAPPTPEAPG APAQLPFVLL 600 
NYHRVLARRG PLGGAAPAAS GLACAPGGPE AATGALRLRH PSPAATSPPG APLPHYLGAS 660 
VHTNGR 



SEQ ID N0:111 PFJ7 DNA SEQUENCE 

Nucleic Acid Accession*: NM_005549 

Coding sequence: 1-1254 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATG AACGGAC GCTGCATCTG CCCGTCCCTG CCCTACTCAC CCGTCAGCTC CCCGCAGTCC 60 
TCGCCTCGGC TGCCCCGGCG GCCGACAGTG GAGTCTCACC ACGTCTCCAT CACGGGTATG 120 
CAGGACTGTG TGCAGCTGAA TCAGTATACC CTGAAGGATG AAATTGGAAA GGGCTCCTAT 180 
GGTGTCGTCA AGTTGGCCTA CAATGAAAAT GACAATACCT ACTATGCAAT GAAGGTGCTG 240 
TCCAAAAAGA AGCTGATCCG GCAGGCCGGC TTTCCACGTC GCCCTCCACC CCGAGGCACC 300 
CGGCCAGCTC CTGGAGGCTG CATCCAGCCC AGGGGCCCCA TTGAGCAGGT GTACCAGGAA 360 
ATTGCCATCC TCAAGAAGCT GGACCACCCC AATGTGGTGA AGCTGGTGGA GGTCCTGGAT 420 
GACCCCAATG AGGACCATCT GTACATGGTG TTCGAACTGG TCAACCAAGG GCCCGTGATG 480 
GAAGTGCCCA CCCTCAAACC ACTCTCTGAA GACCAGGCCC GTTTCTACTT CCAGGATCTG 540 
ATCAAAGGCA TCGAGTACTT ACACTACCAG AAGATCATCC ACCGTGACAT CAAACCTTCC 600 
AACCTCCTGG TCGGAGAAGA TGGGCACATC AAGATCGCTG ACTTTGGTGT GAGCAATGAA 660 
TTCAAGGGCA GTGACGCGCT CCTCTCCAAC ACCGTGGGCA CGCCCGCCTT CATGGCACCC 720 
GAGTCGCTCT CTGAGACCCG CAAGATCTTC TCTGGGAAGG CCTTGGATGT TTGGGCCATG 780 
GGTGTGACAC TATACTGCTT TGTCTTTGGC CAGTGCCCAT TCATGGACGA GCGGATCATG 840 
TGTTTACACA GTAAGATCAA GAGTCAGGCC CTGGAATTTC CAGACCAGCC CGACATAGCT 900 
GAGGACTTGA AGGACCTGAT CACCCGTATG CTGGACAAGA ACCCCGAGTC GAGGATCGTG 960 
GTGCCGGAAA TCAAGCTGCA CCCCTGGGTC ACGAGGCATG GGGCGGAGCC GTTGCCGTCG 1020 
GAGGATGAGA ACTGCACGCT GGTCG AAGTG ACTGAAGAGG AGGTCGAGAA CTCAGTCAAA 1080 
CACATTCCCA GCTTGGCAAC CGTGATCCTG GTGAAGACCA TGATACGTAA ACGCTCCTTT 1140 
GGGAACCCAT TCGAGGGCAG CCGGCGGGAG GAACGCTCAC TGTCAGCGCC TGGAAACTTG 1200 
CTCACCAAAA AACCAACCAG GGAATGTGAG TCCCTGTCTG AGCTCAAGAC CTAGAAAATA 1260 
AGTCCCCTTC CTGCCTGTTG CAAAGTAACG TAAGAGTTCC CTCACCCGAG TGGATGCAGA 1320 
CGTTCTTGCT GTCAGCCACC TTCCTTCATA CACATAGCCA GCCCAGGGTG ACCAGAACGT 1380 
CCCAGGACAG ATGAGGCTTT GTGTCCTTAT GAGAGTGGGA GAACCTGGTG GGCACCCCTG 1440 
GTGCAGGTGC TGTGGTGGGT GGGGACCCCA CTGCCTTTCC CACTGAGCAC ATCATGGCTA 1500 
CCTGACTTGG TGGGAGTTCC ATTCAGTCAC TTCTGTTTCT TAAACATAGC TTTACTGAGG 1560 
TACAATTCAC ATACCATGTA ATTCACCCAC GGGAAGTGTA TGATTCAGTG GTTTCTAATA 1620 
CACACTTCTG CAGCCATTAC CACCGTCAAC TTTACGACAT TTTCATCAGC CCAAGAAGAC 1680 
ACCCTACACT CCTTAGCTGT CCCCATCCAA CTCCCCCACC CCAGTAACCA CTCAGAATAG 1740 
GTATGGATTT GCCTATTCTG GACGTTTCGT ATAAATGGCG TCATACACTA AAAAAAAAAA 1800 
AAAA 



SEP ID N0:112 PFJ7 Protein seouence: 
Protein Accession*: NP.006540.1 

1 11 21 31 41 51 
I I I ] I I 

MNGRQCPSL PYSPVSSPQS SPRLPRRPTV ESHHVS1TGM QDCVQLNQYT LKDEIGKGSY 60 
GVVKLAYNEN DNTYYAMKVL SKKKLIRQAG FPRRPPPRGT RPAPGGCIQP RGPIEQVYQE 120 
IAE.KKLDHP NVVKLVEVLD DPNEDHLYMV FELVNQGPVM EVPTLKPLSE DQARFYFQDL 180 
KGIEYLHYQ K1IHRDIKPS NLLVGEDGHI KIADFGVSNE FKGSDALLSN TVGTPAFMAP 240 
ESLSETRKIF SGBCALDVWAM GVTLYCFVFG QCPFMDEPJM CLHSKIKSQA LEFPDQPDIA 300 
EDLKDLITRM LDKNPESRIV VPEIKLHPWV TRHGAEPLPS EDENCTLVEV TEEEVENSVK 360 
HIPSLATVIL VKTMIRKRSF GNPFEGSRRE ERSLSAPGNL LTKKPIRECE SLSELKT 



SEQ ID N0:113 PFJ6 DNA SEQUENCE 

Nucleic Acid Accession*: NM_021810 

Coding sequence: t -429 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGAAACCTC TG ATATGGAC ATGGTCAGAT GTTGA AGGCC AG AGGCCGGC TCTGCTCATC 60 

TGCACAGCTG CAGCAGGACC CACGCAGGGA GTTAAGGGTT ATGGCAAGCC CTTTGAGCCA 120 

AGAAGTGTGA AAAACATACA CTCTACTCCT GCTTACCCAG ATGCCACAAT GCACAGACAA 180 

CTCCTGGCTC CGGTGGAAGG AAGGATGGCA GAGACATTGA ATCAGAAACT CCATGTTGCC 240 

AATGTGCTGG AAGATGAOCC CGGCTACCTA CCTCACGTCT ACAGCGAGGA AGGGGAGTGT 300 

GGAGGGGCCC CATCCCTCAG CTCTCTGGCC AGCTTGGAAC AGGAGTTGCA ACCTGATTTG 360 

342 
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CTGGACTCTT TGGGTTCAAA AGCGACTCCG TTTGAGGAAA TATATTCAGA GTCAGGTGTT 420 
CCTTCCTAA 



5 SEQ ID N0:114 PFJ6 Protein sequence: 
Protein Accession #; NP.068582.1 

1 11 21 31 41 51 

1n I I I I I I 

10 MKPLIWTWSD VEGQRPALLI CTAAAGPTQG VKGYGKPFEP RSVKNIHSTP AYPDATMHRQ 60 
LLAPVEGRMA ETLNQKLHVA NVLEDDPGYL PHVYSEEGEC GGAPSLSSLA SLEQELQPDL 120 
LDSLGSKATP FEEIYSESGV PS 



SEQ ID N0:115 PFJ5 DNA SEQUENCE 

Nucleic Acid Accessions: NMJW6361 

20 1 11 21 31 41 51 
I I I I I I 

CGAATGCAGG CGACTTGCGA GCTGGGAGCG ATTTAAAACG CTTTGGATTC CCCCGGCCTG 60 
GGTGGGGAGA GCGAGCTGGG TGCCCCCTAG ATTCCCCGCC CCCGCACCTC ATGAGCCGAC 120 
CCTCGGCTCC ATQGAGCCCG GCAATTATGC CACCTTGGAT GGAGCCAAGG ATATCGAAGG 180 
25 CTTGCTGGGA GCGGGAGGGG GGCGGAATCT GGTCGCCCAC TCCCCTCTGA CCAGCCACCC 240 
AGCGGCGCCT ACGCTGATGC CTGCTGTCAA CTATGCCCCC TTGGATCTGC CAGGCTCGGC 300 
GGAGCCGCCA AAGCAATGCC ACCCATGCCC TGGGGTGCCC CAGGGGACGT CCCCAGCTCC 360 
CGTGCCTTAT GGTTACTTTG GAGGCGGGTA CTACTCCTGC CGAGTGTCCC GGAGCTCGCT 420 
_ GAAACCCTGT GCCCAGGCAG CCACCCTGGC CGCGTACCCC GCGGAGACTC CCACGGCCGG 480 

30 GGAAG AGTAC CCCAGTCGCC CCACTG AGTT TGCCTTCTAT CCGGGATATC CGGGAACCTA 540 
CCACGCTATG GCCAGTTACC TGGACGTGTC TGTGGTGCAG ACTCTGGGTG CTCCTGGAGA 600 
ACCGCGACAT GACTCCCTGT TGCCTGTGGA CAGTTACCAG TCTTGGGCTC TCGCTGGTGG 660 
CTGGAACAGC CAGATGTGTT GCCAGGGAGA ACAGAACCCA CCAGGTCCCT TTTGGAAGGC 720 
_ AQCATTTGCA GACTCCAGCG GGCAGCACCC TCCTGACGCC TGCGCCTTTC GTCGCGGCCG 780 

35 CAAGAAACGC ATTCCGTACA GCAAGGGGCA GTTGCGGGAG CTGGAGCGGG AGTATGCGGC 840 
TAACAAGTTC ATCACCAAGG ACAAGAGGCG CAAGATCTCG GCAGCCACCA GCCTCTCGGA 900 
GCGCCAGATT ACCATCTGGT TTCAGAACCG CCGGGTCAAA GAGAAGAAGG TTCTCGCCAA 960 
GGTGAAGAAC AGCGCTACCC CTTAAGAGAT CTCCTTGCCT GGGTGGGAGG AGCGAAAGTG 1020 
GGGGTGTCCT GGGGAGACCA GAAACCTGCC AAGCCCAGGC TGGGGCCAAG GACTCTGCTG 1080 
40 AGAGGCCCCT AGAGACAACA CCCTTCCCAG GCCACTGGCT GCTGGACTGT TCCTCAGGAG 1 i40 
CGGCCTGGGT ACCCAGTATG TGCAGGGAGA CGGAACCCCA TGTGACAGGC CCACTCCACC 1200 
AGGGTTCCCA AAGAACCTGG CCCAGTCATA ATCATTCATC CTCACAGTGG CAATAATCAC 1260 
GATAACCAGT 

45 

SEQ ID N0:116 PFJ5 Protein sequence: 
Protein Accession If: NP_006352.1 

50 1 11 21 31 41 51 

MEPGNYATLD GAKDffiGLLG AGGGRNLVAH SPLTSHPAAP TLMPAVNYAP LDLPGS AEPP 60 
KQCHPCPGVP QGTSPAPVPY GYFGGGYYSC RVSRSSLKPC AQAATLAAYP AETPTAGEEY 120 
PSRPTEFAFY PGYPGTYHAM ASYLDVSVVQ TLGAPGEPRH DSLLPVDSYQ SWALAGGWNS 180 
55 QMCCQGEQNP PGPFVVKAAFA DSSGQHPPDA CAFRRGRKKR IPYSKGQLRE LEREYAANKF 240 
ITKDKRRKIS AATSLSERQI T1WFQNRRVK EKKVLAKVKN SATP 



60 SEQIDNO:117PFJ4DNASEQUENCE 

Nucleic Acid Accession #: NMJJ05628 

Coding sequence: 591-2216 (underlined sequences correspond to start and stop codons) 



GTAACCGCTA CTCCCGGACA CCAGACCACC GCCTTCCGTA CACAGGGGCC CGCATCCCAC 60 
CCTCCCGGAC CTAAGAGCCT GGGTCCCCTG TTTCCGGAGG TCCGCTTCCC GGCCCCCAGA 1 20 
TTCTGGCATC CCAGCCCTCA GTGTCCAAGA CCCAGGCAGC CCGGGTCCCC GCCTCCCGGA 180 
_ _ TCCAGGCGTC CGGGATCTGC GCCACCAGAA CCTAGCCTCC TGCAGACCTC CGCCATCTGG 240 
70 GGGCACTCAA CCTCCTGGAG CCAAGGGCCC CACGTCCCAC CCAGAGAAAC TCTCGTATTC 300 
CCAGCTCCTA GGGCCAAGGA ACCCGGGCGC TCCGAACTCC CAGCTTTCGG ACATCTGGCA 360 
CACGGGGCAG AGCAGAGAAG CTCAGCGCCC AGCCTGGGGA ATTTAAACAC TCCAGCTTCC 420 
AAGAGCCAAG GAACTTCAGT GCTGTGAACT CACAACTCTA AGGAGCCCTC CAAAGTTCCA 480 
_ _ GTCTCCAGGT GCTGTTACTC AACTCAGTCC TAGGAACGTC GGGTCCTGGG AAGGAGCCCA 540 
75 AGCGCTCCCA GCCAGCTTCC AGGCGCTAAG AA ACCCCGGT GCTTCCCATC ATGGTGGCCG 600 
ATCCTCCTCG AGACTCCAAG GGGCTCGCAG CGGCGGAGCC CACCGCCAAC GGGGGCCTGG 660 
CGCTGGCCTC CATCGAGGAC CAAGGCGCGG CAGCAGGCGG CTACTGCGGT TCCCGGGACC 720 
AGGTGCGCCG CTGOCTTCGA GCCAACCTGC TTGTGCTGCT GACAGTGGTG GCCGTGGTGG 780 
CCGGCGTGGC GCTGGGACTG GGGGTGTCGG GGGCCGGGGG TGCGCTGGCG TTGGGCCCGG 840 
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AGCGCTTGAG CGCCTTCGTC TTCCCGGGCG AGCTGCTGCT GCGTCTGCTG CGGATGATCA 900 
TCTTGCCGCT GGTGGTGTGC AGCTTGATCG GCGGCGCCGC CAGCCTGGAC CCCGGCGCGC 960 
TCGGCCGTCT GGGCGCCTGG GCGCTGCTCT TTTTCCTGGT CACCACGCTG CTGGCGTCGG 1020 
CGCTCGGAGT GGGCTTGGCG CTGGCTCTGC AGCCGGGCGC CGCCTCCGCC GCCATCAACG 1080 
CCTCCGTGGG AGCCGCGGGC AGTGCCGAAA ATGCCCCCAG CAAGGAGGTG CTCGATTCGT 1140 
TCCTGGATCT TGCGAGAAAT ATCTTCCCTT CCAACCTGGT GTCAGCAGCC TTTCGCTCAT 1200 
ACTCTACCAC CTATGAAGAG AGG AATATCA CCGGAACCAG GGTGAAGGTG CCCGTGGGGC 1260 
AGGAGGTGGA GGGGATGAAC ATCCTGGGCT TGGTAGTGTT TGCCATCGTC TTTGGTGTGG 1320 
CGCTGCGGAA GCTGGGGCCT GAAGGGGAGC TGCTTATCCG CTTCTTCAAC TCCTTCAATG 1380 
AGGCCACCAT GGTTCTGGTC TCCTGGATCA TGTGGTACGC CCCTGTGGGC ATCATGTTCC 1440 
TGGTGGCTGG CAAGATCGTG GAGATGGAGG ATGTGGGTIT ACTCTTTGCC CGCCTTGGCA 1500 
AGTACATTCT GTGCTGCCTG CTGGGTCACG CCATCCATGG GCTCCTGGTA CTGCCCCTCA 1560 
TCTACTTCCT CTTCACCCGC AAAAACCCCT ACCGCTTCCT GTGGGGCATC GTGACGCCGC 1620 
TGGCCACTGC CTTTGGGACC TCTTCCAGTT CCGCCACGCT GCCGCTG ATG ATG AAGTGCG 1680 
TGGAGGAGAA TAATGGCGTG GCCAAGCACA TCAGCCGTTT CATCCTGCCC ATCGGCGCCA 1740 
CCGTCAACAT GGACGGTGCC GCGCTCTTCC AGTGCGTGGC CGCAGTGTTC ATTGCACAGC 1800 
TCAGCCAGCA GTCCTTGGAC TTCGTAAAGA TCATCACCAT CCTGGTCACG GCCACAGCGT I860 
CCAGCGTGGG GGCAGCGGGC ATCCCTGCTG GAGGTGTCCT CACTCTGGCC ATCATCCTCG 1920 
AAGCAGTCAA CCTCCCGGTC GACCATATCT CCTTGATCCT GGCTGTGGAC TGGCTAGTCG 1980 
ACCGGTCCTG TACCGTCCTC AATGTAGAAG GTGACGCTCT GGGGGCAGGA CTCCTCCAAA 2040 
ATTATGTGGA CCGTACGGAG TCGAGAAGCA CAGAGCCTGA GTTGATACAA GTGAAGAGTG 2100 
AGCTGCCCCT GGATCCGCTG CCAGTCCCCA CTGAGGAAGG AAACCCCCTC CTCAAACACT 2160 
ATCGGGGGCC CGCAGGGGAT GCCACGGTCG CCTCTGAGAA GGAATCAGTC ATGIMACCC 2220 
CGGGAGGGAC CTTCCCTGCC CTGCTGGGGG TGCTCTTTGG ACACTGGATT ATGAGGAATG 2280 
GATAAATGGA TGAGCTAGGG CTCTGGGGGT CTGCCTGCAC ACTCTGGGGA GCCAGGGOCC 2340 
CCAGCACCCT CCAGGACAGG AGATCTGGGA TGCCTGGCTG CTGGAGTACA TGTGTTCACA 2400 
AGGGTTACTC CTCAAAACCC CCAGTTCTCA CTCATGTCCC CAACTCAAGG CTAGAAAACA 2460 
GCAAGATGGA GAAATAATGT TCTGCTGCGT CCCCACCGTG ACCTGCCTGG CCTCCCCTGT 2520 
CTCAGGGAGC AGGTCACAGG TCACCATGGG GAATTCTAGC CCCCACTGGG GGGATGTTAC 2580 
AACACCATGC TGGTTATTTT GGCGGCTGTA GTTGTGGGGG GATGTGTGTG TGCACGTGTG 2640 
TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TTCTGTGACC TCCTGTCCCC ATGGTACGTC 2700 
CCACCCTGTC CCCAGATCCC CTATTCCCTC CACAATAACA GAAACACTCC CAGGGACTCT 2760 
GGGGAGAGGC TGAGGACAAA TACCTGCTGT CACTCCAGAG GACAnTTTT TTAGCAATAA 2820 
AATTGAGTGT CAACTATTTA AAAAAAAAAA AAAAAA 



1 11 21 31 41 51 
I I I I I I 

MVADPPRDSK GLAAAEPTAN GGLALASED QGAAAGGYCG SRDQVRRCLR ANLLVIXTVV 60 
AWAGVALGL GVSGAGGALA LGPERLSAFV FPGELLLRLL RM1ILPLVVC SUGGAASLD 120 
PGALGRLGAW ALLFFLVTTL LASALGVGLA LALQPGAASA AMASVGAAG SAENAPSKEV 180 
LDSFLDLARN IFPSNLVSAA FRSYSTTYEE RNITGTRVKV PVGQEVEGMN ELGLWFAIV 240 
FGVALRKLGP EGEIXIRFFN SFNEATMVLV SWIMWYAPVG IMFLVAGKIV EMEDVGLLFA 300 
RLGKYILCCL LGHAIHGLLV LPLIYFLFTR KNPYRFLWGI VTPLATAFGT SSSSATLPLM 360 
MKCVEENNGV AKHISRFILP 1GATVNMDGA ALFQCVAAVF 1AQLS0QSLD FVKimLVT 420 
ATASSVGAAG 1PAGGVLTLA IILEAVNLPV DUISLILAVD WLVDRSCTVLNVEGDALGAG 480 
LLQNYVDRTE SRSTEPELIQ VKSELPLDPL PVPTEEGNPL LKHYRGPAGD ATVASEKES V 540 



55 SEQ ID N0.-119PFJ3DNA SEQUENCE 

Nucleic Acid Accession*: NM_006708 

Coding sequence: 88-642 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 

60 | | | | | | 

CTAGTTAAGG CGGCACAGGG CCGAGGCGTA GTGTGGGTGA CTCCTCCGTT CCTTGGGTCC 60 
CGTCGTCTGT GATACTGCAG TTCAGCCAJG. GCAGAACCGC AGCCCCCGTC CGGCGGCCTC 120 
ACGGACGAGG CCGCCCTCAG TTGCTGCTCC GACGCGGACC CCAGTACCAA GGATTTTCTA 180 
TTGCAGCAGA CCATGCTACG AGTGAAGGAT CCTAAGAAGT CACTGGATTT TTATACTAGA 240 

65 GTTCTTGGAA TGACGCTAAT CCAAAAATGT GATTTTCCCA TTATGAAGTT TTCACTCTAC 300 

TTCTTGGCTT ATGAGGATAA AAATGACATC CCTAAAGAAA AAGATGAAAA AATAGCCTGG 360 
GCGCTCTCCA GAAAAGCTAC ACTTGAGCTG ACACACAATT GGGGCACTGA AGATGATGCG 420 
ACCCAGAGTT ACCACAATGG CAATTCAGAC CCTCGAGGAT TCGGTCATAT TGGAATTGCT 480 
GTTCCTGATG TATACAGTGC TTGTAAAAGG TTTGAAGAAC TGGGAGTCAA ATTTGTGAAG 540 

70 AAACCTGATG ATGGTAAAAT GAAAGGCCTG GCATTTATTC AAGATCCTGA TGGCTACTGG 600 
ATTGAAATTT TGAATCCTAA CA AAATGGCA ACCTTAATGT_AG.TGCTGTGA G AATTCTCCT 660 
TTGAOATTTC AGAAGAAAGG AAACAATGTG ATTCAAGATA TTTACATACC AGAAGCATCT 720 
AGGACTGATG GATCACTGTC CCGATTCAAA TTATTCTTCA GTCCATTTCC CCTTCCTATT 780 

_ TCAGCTGTTC CTTTTCACCT AACTGTTCAG TC ATTCTGGT TTTCAAGCAG TGCTTTATCT 840 

75 CATGTCCTTG AATATAGTTG TGTAACTTTA 1 U 1 1 1 AGGT AAT A ATT AG A ACAGTTCCCT 900 
TCAGAGGCTG CATTTGCCTT CTTCTGCCAC CTAAATATTA CTTCCCTTCA AATCTGCCTT 960 
TGAATCATCA TTTTTAAAAA AAAATTAACA TGTTTTTGTT GTAGTTATCT TCTGGGGTTT 1020 
CAATTCCTCA GAAACAACTT TTTTCACAAC GGAAAGGAAA GAACACTAGT GTTCTTTCAG 1080 
TAAAGTACAA AGTGTTTATT TTACAAAAGA GTAGGTACTC TTGAG AGCAA TTCAAATCAT 1 140 
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GCTGACAAGG ATACTGATAG AAAAAGTGAT TTCTTCTTAT TATAAAGTAC ATTTAAAGTT 1200 
CAAGGACTAA CCTTATTTAT TTGGGAAAGG GGAGGAGGAA GGAAATGATA TGGTACCCAG 1260 
ACACTGGGCT AGGCTGCAAC TTTATCTCAT TTAATACTCC CAGCTGTCAT GTGAGAAAG A 1320 
AAGCAGGCTA GGCATGTGAA ATCACTTTCA TGGATTATTA ATGGATTTAA GAGGGCATCA 1380 
3 ATCAGCTCAA CTCAAGATTT CATAATCATT TTTAGTATTT AGATTGTGCC TCAAAGTTGT 1440 
AGTACCTCAC AATACCTCCA CTGGTTTCCT GTTGTAAAAA CCTTCAGTGA GTTTGACCAT 1500 
TGTGCTCTTG GCTCTTGGGC TGGAGTACCG TGGTGAGGGA GTAAACACTA GAAGTCTTTA 1560 
GTACAAAACT GCTCTAGGGA CACCTGGTGA TTCCTACACA AGTGATGTTT ATATTTCTCA 1620 
TAAAGAGTCT TCCCTATCCC AAGGTCTTCA TGATGCCAGT AGCCATATAT GATAAATTAT 1680 

1U GTTCAGTGAT AACT TAGTTA TCAQAAATCA GCTCAGTGGT CTTCCCCGCC ATGATTCACA 1740 
TTTGATGAGT TTTTAAAAAT CAAAGTGATT ITGAAAATCT CTAATGGCTC AGAAAATAAA 1800 
AACATCCAGT TTGTGGATGA CTATATTTAG ATTTCTCTAG ACTCTAGTGG AAGACCTTTG 1860 
GAAAGGCCAT GCCAACCGTG CTTGTACTGC TAGAAGCACT TTATGTTTCC TTTTTGGGTG 1920 
AAATGGATTT ATGTGAGTGC TTTAAACAAA TAGCAATACT TATAGACTGA AATAAAATGA 1980 

15 AACTTCAAAT AAG 



MAEPQPPSGG LTDEAALSCC SDADPSTKDF LLQQTMLRVK DPKKSLDFYT RVLGMTLIQK 60 
CDFPIMKFSL YFLAYEDKND IPKEKDEKIA WALSRKATLE LTHNWGTEDD ATQS YHNGNS 120 
£3 DPRGFGHIGI AVPDVYSACK RFEELGVKFV KKPDDGKMKG LAF1QDPDGY WIEILNPNKM 180 
ATLM 



50 



SEQ ID N0:121 PFJ2 DNA SEQUENCE 

(underlined sequences correspond to slart and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCGACGCCAG GTCCTGCCGT CCCGCCGACC GTCCGGGAGC GAACCCGTCG TCCCGCACTG 60 
GA GTCC GCGATGGCTTCAGT GACAGATGGT AAACATGGAG TCAAAGATGC CTCTGACCAG 120 
AATTTTGACT ACATGTTTAA ACTGCTTATC ATTGGCAACA GCAGTGTTGG CAAGACCTCC 180 
TrCCTCTTGC GCTATGCTGA TGACACGTTC ACCCCAGCCT TCGTTAGCAC CGTGGGCATC 240 
GACTTCAAG G TGAAGACAGT CTACCGTCAC GAGAAGCGGG TGAAACTGCA GATCTGGGAC 300 
ACAGCTGGGC AGGAGCGGTA CCGGACCATC ACAACAGCCT ATTACCGTGG GGCCATGGGC 360 
TTCATTCTGA TGTATGACAT CACCAATGAA GAGTCCTTCA ATGCTGTCCA AGACTGGGCT 420 
ACTCAGATCA AGACCTACTC CTGGGACAAT GCACAAGTTA TTCTGGTGGG GAACAAGTGT 480 
GACATGGAGG AAG AG AGGGT TGTTCCCACT GAGAAGGGCC AGCTCCTTGC AGAGCAGCTT 540 
GGGTTTGATT TCTTTGAAGC CAGTGCAAAG GAGAACATCA GTGTAAGGCA GGCCTTTGAG 600 
CGCCTGGTGG ATGCCATTTG TGACAAGATG TCTGATTCGC TGGACACAGA CCCGTCGATG 660 
CTGGGCTCCT CCAAGAACAC GCGTCTCTCG GACACCCCAC CGCTGCTGCA GCAGAACTGC 720 
TCATGCTAjSC AAGGCCCACC TTCCTGACCT CCCCTCATTG TGGCCCCACA CCCAAGTCTG 780 
CTTCTCCCTG TTACACACTG TCCGCTCT 



1 II 21 31 41 51 
I I I I I I 

MAS VHX5KHG VKD ASDQNFD YMFKLUIGN SSVGKTSFLL RYADDTFTPA FVSTVGIDFK 60 
VKTVYRHEKR VKLQIWDTAG QKRYRTITTA YYRGAMGFIL MYDITNEESF NAVQDWATQI 120 
KTYSWDNAQV ILVGNKCDME EERWPTEKG QtXAEQLGFD FFEASAKENI SVRQAFERLV 180 
DAICDKMSDS LDTDPSMLGS SKNTRLSDTP PLLQQNCSC 



'5 SEQ ID NO:123 PFJ1 DNA SEQUENCE 
Nucleic Acid Accession*: NM_001844 

Coding sequence: 1544621 (undefined sequences correspond to start and stop codons) 

n 1 11 21 31 41 51 

ACGCAGAGCG CTGCTGGGCT GCCGGGTCTC CCGCTTCCTC CTCCTGCTCC AAGGGCCTCC 60 
TGCATGAGGG CGCGGTAGAG ACCCGGACCC GCGCCGTGCT CCTGCCGTTT CGCTGCGCTC 120 
CGCCCGGGCC CGGCTCAGCC AGGCCCCGCG GTGAGCCATQ ATTCGCCTCG GGGCTCCCCA 180 
_ GTCGCTGGTG CTGCTGACGC TGCTCGTCGC CGCTGTCCTT CGGTGTCAGG GCCAGGATGT 240 
J CCAGGAGGCT GGCAGCTGTG TGCAGGATGG GCAGAGGTAT AATGATAAGG ATGTGTGGAA 300 
GCCGGAGCCC TGCCGGATCT GTGTCTGTGA CACTGGGACT GTCCTCTGCG ACGACATAAT 360 
CTGTGAAGAC GTGAAAGACT GCCTCAGCCC TGAGATCCCC TTCGGAGAGT GCTGCCCCAT 420 
CTGCCCAACT GACCTCGCCA CTGCCAGTGG GCAACCAGGA CCAAAGGGAC AGAAAGGAGA 480 
ACCTGGAGAC ATCAAGGATA TTGTAGG ACC CAA AGG ACCT CCTGGGCCTC AGGGACCTCC 540 
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AGGGGAACAA GGACCCAGAG GGGATCGTGG TGACAAAGGT GAAAAAGGTG CCCCTGGACC 600 
TCGTGGCAGA GATGGAGAAC CTGGGACCCC TGGAAATCCT GGCCCCCCTG GTCCTCCCGG 660 
CCCCCCTGGT CCCCCTGGTC TTGGTGGAAA CTTTGCTGCC CAGATGGCTG GAGGATTTGA 720 
TGAAAAGGCT GGTGGCGCCC AGTTGGGAGT AATGCAAGGA CCAATGGGCC CCATGGGACC 780 
5 TCGAGGACCT CCAGGCCCTG CAGGTGCTCC TGGGCCTCAA GGATTTCAAG GCAATCCTGG 840 
TGAACCTGGT GAACCTGGTG TCTCTGGTCC CATGGGTCCC CGTGGTCCTC CTGGTCCCCC 900 
TGGAAAGCCT GGTGATGATG GTGAAGCTGG AAAACCTGGA AAAGCTGGTG AAAGGGGTCC 960 
GCCTGGTCCT CAGGGTGCTC GTGGTTTCCC AGGAACCCCA GGCCTTCCTG GTGTCAAAGG 1020 
TCACAGAGGT TATCCAGGCC TGGACGGTGC TAAGGGAGAG GCGGGTGCTC CTGGTGTGAA 1080 

10 GGGTGAGAGT GGTTCCCCGG GTGAGAACGG ATCTCCGGGC CCAATGGGTC CTCGTGGCCT 1140 
GCCTGGTGAA AGAGGACGGA CTGGCCCTGC TGGCGCTGCG GGTGCCCGAG GCAACGATGG 1200 
TCAGCCAGGC CCCGCAGGTC CTCCGGGTCC TGTCGGTCCT GCTGGTGGTC CTGGCTTCCC 1260 
TGGTGCTCCT GGAGCCAAGG GTGAAGCCGG CCCCACTGGT GCCCGTGGTC CTGAAGGTGC 1320 

, _ TCAAGGTCCT CGCGGTGAAC CTGGTACTCC TGGGTCCCCT GGGCCTGCTG GTGCCTCCGG 1380 

1 5 TAACCCTGGA ACAGATGGAA TTCCTGGAGC CAAAGGATCT GCTGGTGCTC CTGGCATTGC 1440 
TGGTGCTCCT GGCTTCCCTG GGCCACGGGG TCCTCCTGGC CCTCAAGGTG CAACTGGTCC 1500 
TCTGGGCCCG AAAGGTCAGA CGGGTGAACC TGGTATTGCT GGCTTCAAAG GTGAACAAGG 1560 
CCCCAAGGGA GAACCTGGCC CTGCTGGCCC CCAGGGAGCC CCTGGACCCG CTGGTGAAGA 1620 
AGGCAAGAGA GGTGCCCGTG GAGAGCCTGG TGGCGTTGGG CCCATCGGTC CCCCTGGAGA 1680 

20 AAGAGGTGCT CCCGGAAACC GCGGTTTCCC AGGTCAAGAT GGTCTGGCAG GTCCCAAGGG 1740 
AGCCCCTGGA GAGCGAGGGC CCAGTGGTCT TGCTGGCCCC AAGGGAGCCA ACGGTGACCC 1800 
TGGCCGTCCT GGAGAACCTG GCCTTCCTGG AGCCCGGGGT CTCACTGGCC GCCCTGGTGA 1860 
TGCTGGTCCT CAAGGCAAAG TTGGCCCTTC TGGAGCCCCT GGTGAAGATG GTCGTCCTGG 1920 
ACCTCCAGGT CCTCAGGGGG CTCGTGGGCA GCCTGGTGTC ATGGGTTTCC CTGGCCCCAA 1980 

25 AGGTGCCAAC GGTGAGCCTG GCAAAGCTGG TGAGAAGGGA CTGCCTGGTG CTCCTGGTCT 2040 
GAGGGGTCTT CCTGGCAAAG ATGGTGAGAC AGGTGCTGCA GGACCCCCTG GCCCTGCTGG 2100 
ACCTGCTGGT GAACGAGGCG AGCAGGGTGC TCCTGGGCCA TCTGGGTTCC AGGGACTTCC 21 60 
TGGCCCTCCT GGTCCCCCAG GTGAAGGTGG AAAACCAGGT GACCAGGGTG TTCCCGGTGA 2220 
AGCTGGAGCC CCTGGCCTCG TGGGTCCCAG GGGTGAACGA GGTTTCCCAG GTGAACGTGG 2280 

30 CTCTCCCGGT GCCCAGGGCC TCCAGGGTCC CCGTGGCCTC CCCGGCACTC CTGGCACTGA 2340 
TGGTCCCAAA GGTGCATCTG GCCCAGCAGG CCCCCCTGGC GCACAGGGCC CTCCAGGTCT 2400 
TCAGGGAATG CCTGGCGAGA GGGGAGCAGC TGGTATCGCT GGGCCCAAAG GCGACAGGGG 2460 
TGACGTTGGT GAGAAAGGCC CTGAGGGAGC CCCTGGAAAG GATGGTGGAC GAGGCCTGAC 2520 
AGGTCCCATT GGCCCCCCTG GCCCAGCTGG TGCTAACGGC GAGA AGGGAG AAGTTGGACC 2580 

35 TCCTGGTCCT GCAGGAAGTG CTGGTGCTCG TGGCGCTCCG GGTGAACGTG GAGAGACTGG 2640 
CCCCCCCGGA CCAGCGGGAT TTGCTGGGCC TCCTGGTGCT GATGGCCAGC CTGGGGCCAA 2700 
GGGTGAGCAA GGAGAGGCCG GCCAGAAAGG CGATGCTGGT GCCCCTGGTC CICAGGGCCC 2760 
CTCTGGAGCA CCTGGGCCTC AGGGTCCTAC TGGAGTGACT GGTCCTAAAG GAGCCCGAGG 2820 
TGCCCAAGGC CCCCCGGGAG CCACTGGATT CCCTGGAGCT GCTGGCCGCG TTGGACCCCC 2880 

40 AGGCTCCAAT GGCAACCCTG GACCCCCTGG TCCCCCTGGT CCTTCTGGAA AAGATGGTCC 2940 
CAA AGGTGCT CGAGG AGACA GCGGCCCCCC TGGCCGAGCT GGTGAACCCG GCCTCCAAGG 3000 
TCCTGCTGGA CCCCCTGGCG AGAAGGGAGA GCCTGGAGAT GACGGTCCCT CTGGTGCCGA 3060 
AGGTCCACCA GGTCCCCAGG GTCTGGCTGG TCAGAGAGGC ATCGTCGGTC TGCCTGGGCA 3120 
ACGTGGTGAG AGAGGATTCC CTGGCTTGCC TGGCCCATCG GGTGAGCCCG GCAAGCAGGG 3180 

45 TGCTCCTGGA GCATCTGGAG ACAGAGGTCC TCCTGGCCCC GTGGGTCCTC CTGGCCTGAC 3240 
GGGTCCTGCA GGTGAACCCG GACGAGAGGG AAGCCCCGGT GCTGATGGCC CCCCTGGCAG 3300 
AGATGGCGCT GCTGGAGTCA AGGGTGATCG TGGTGAGACT GGTGCTGTGG GAGCTCCTGG 3360 
AGCCCCTGGG CCCCCTGGCT CCCCTGGCCC CGCTGGTCCA ACTGGCAAGC AAGGAGACAG 3420 
AGGAGAAGCT GGTGCACAAG GCCCC ATGGG ACCCTCAGGA CCAGCTGGAG CCCGGGGAAT 3480 

50 CCAGGGTCCT CAAGGCCCCA GAGGTGACAA AGG AGAGGCT GGAGAGCCTG GCGAGAGAGG 3540 
CCTGAAGGGA CACCGTGGCT TCACTGGTCT GCAGGGTCTG CCCGGCCCTC CTGGTCCTTC 3600 
TGGAGACCAA GGTGCTTCTG GTCCTGCTGG TCCTTCTGGC CCTAGAGGTC CTGCTGGCCC 3660 
CGTCGGTCCC TCTGGCAAAG ATGGTGCTAA TGGAATCCCT GGCCCCATTG GGCCTCCTGG 3720 
TCCCCGTGGA CGATCAGGCG AAACCGGTCC TGCTGGTCCT CCTGGAAATC CTGGGCCCCC 3780 

55 TGGTCCTCCA GGTCCCCCTG GCCCTGGCAT CGACATGTCC GCCTTTGCTG GCTTAGGCCC 3840 

GAGAGAGAAG GGCCCCGACC CCCTGCAGTA CATGCGGGCC GACCAGGCAG CCGGTGGCCT 3900 
GAGACAGCAT GACGCCGAGG TG GATGCCAC ACTCAAGTCC CTCAACAACC AGATTGAGAG 3960 
CATCCGCAGC CCCGAGGGCT CCCGCAAGAA CCCTGCTCGC ACCTGCAGAG ACCTGAAACT 4020 
CTGCCACCCT GAGTGGAAGA GTGGAGACTA CTGGATTGAC CCCAACCAAG GCTGCACCTT 4080 

60 GGACGCCATG AAGGTTTTCT GCAACATGGA GACTGGCGAG ACTTGCGTCT ACCCCAATCC 4140 

AGCAAACGTT CCCAAGAAGA ACTGGTGGAG CAGCAAGAGC AAGGAGAAGA AACACATCTG 4200 
GTTTGGAGAA ACCATCAATG GTGGCTTCCA TTTCAGCTAT GGAGATGACA ATCTGGCTCC 4260 
CAACACTGCC AACGTCCAGA TGACCTTCCT ACGCCTGCTG TCCACGGAAG GCTCCCAGAA 4320 
CATCACCTAC CACTGCAAGA ACAGCATTGC CTATCTGGAC GAAGCAGCTG GCAACCTCAA 4380 

65 GAAGGCCCTG CTCATCCAGG GCTCCAATGA CGTGGAGATC CGGGCAGAGG GCAATAGCAG 4440 
GTTCACGTAC ACTGCCCTGA AGGATGGCTG CACGAAACAT ACCGGTAAGT GGGGCAAGAC 4500 
TGTTATCGAG TACCGGTCAC AGAAGACCTC ACGCCTCCCC ATCATTGACA TTGCACCCAT 4560 
GGACATAGGA GGGCCCGAGC AGGAATTCGG TGTGGACATA GGGCCGGTCT GCTTCTTGTA 4620 
AAAACCTGAA CCCAGAAACA ACACAATCCG TTGCAAACCC AAAGGACCCA AGTACTTTCC 4680 

70 AATCTCAGTC ACTCTAGGAC TCTGCACTGA ATGGCTG ACC TGACCTGATG TCCATTCATC 4740 
CCACCCTCTC ACAGTTCGGA CTTTTCTCCC CTCTCTTTCT AAGAGACCTG AACTGGGCAG 4800 
ACTGCAAAAT AAAATCTCGG TGTTCTATTT ATTTATTGTC TTCCTGTAAG ACCTTCGGGT 4860 
CAAGGCAGAG GCAGGAAACT AACTGGTGTG AGTCAAATGC CCCCTGAGTG ACTGCCCCCA 4920 
GCCCAGGCCA GAAGACCTCC CTTCAGGTGC CGGGCGCAGG AACTGTGTGT GTCCTACACA 4980 

75 ATGGTGCTAT TCTGTGTCAA ACACCTCTGT ATTTTTTAAA ACATCAATTG ATATTAAAAA 5040 
TGAAAAGATT ATTGGAAAGT 



SFn ID NO:124 PF.MProteln sequence 
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Protein Accession I: NP_001835.2 



c I I I I I I 

J MKLGAPQSL VLLTLLVAAV LRCQGQDVQE AGSCVQDGQR YNDKDVWKPE PCRICVCDTG 60 
TVLCDDUCE DVKDCLSPEI PFGECCPICP TDLATASGQP GPKGQKGEPG DIKDIVGPKG 120 
PPGPQGPAGE QGPRGDRGDK GEKGAPGPRG RDGEPGTPGN PGPPGPPGPP GPPGLGGNFA 180 
AQMAGGFDEK AGGAQLG VMQ GPMGPMGPRG PPGPAGAPGP QGFQGNPGEP GEPGVSGPMG 240 
PRGPPGPPGK PGDDGEAGKP GKAGERGPPG PQGARGFPGT PGLPGVKGHR GYPGLDGAKG 300 
10 EAGAPGVKGE SGSPGENGSP GPMGPRGLPG ERGRTGPAGA AGARGNDGQP GPAGPPGPVG 360 
PAGGPGFPGA PGAKGEAGPT GARGPEGAQG PRGEPGTPGS PGPAGASGNP GTDGIPGAKG 420 
SAGAPGIAGA PGFPGPRGPP GPQGATGPLG PKGQTGEPGI AGFKGEQGPK GEPGPAGPQG 480 
APGPAGEEGK RGARGEPGGV GPIGPPGERG APGNRGFPGQ DGLAGPKGAP GERGPSGLAG 540 
PKGANGDPGR PGEPGLPGAR GLTGRPGDAG PQGKVGPSGA PGEDGRPGPP GPQGARGQPG 600 
1 5 VMGFPGPKGA NGEPGKAGEK GLPGAPGLRG LPGKDGETGA AGPPGPAGPA GERGEQGAPG 660 
PSGFQGLPGP PGPPGEGGKP GDQGVPGEAG APGLVGPRGE RGFPGERGSP GAQGLQGPRG 720 
LPGTPGTDGP KGASGPAGPP GAQGPPGLOG MPGERGAAGI AGPKGDRGDV GEKGPEGAPG 780 
KDGGRGLTGP IGPPGPAGAN GEKGEVGPPG PAGSAGARGA PGERGETGPP GPAGFAGPPG 840 
„ ADGQPGAKGE QGEAGQKGDA GAPGPQGPSG APGPQGPTGV TGPKGARGAQ GPPGATGFPG 900 
20 AAGRVGPPGS NGNPGPPGPP GPSGKDGPKG ARGDSGPPGR AGEPGLQGPA GPPGEKGEPG 960 
DDGPSGAEGP PGPQGLAGQR GIVGLPGQRG ERGFPGLPGP SGEPGKQGAP GASGDRGPPG 1020 
PVGPPGLTGP AGEPGREGSP GADGPPGRDG AAGVKGDRGE TGAVGAPGAP GPPGSPGPAG 1080 
PTGKQGDRGE AGAQGPMGPS GPAGARGIQG PQGPRGDKGE AGEPGERGLK GHRGFTGLQG 1 140 
_ LPGPPGPSGD QGASGPAGPS GPRGPPGPVG PSGKDGANGI PGPIGPPGPR GRSGETGPAG 1100 

25 PPGNPGPPGP PGPPGPGIDM SAFAGLGPRE KGPDPLQYMR ADQAAGGLRQ HDAEVDATLK 1260 
SLNNQDESIR SPEGSRKNPA RTCRDLKLCH PEWKSGDYWI DPNQGCTLDA MKVFCNMETG 1320 
ETCVYPNPAN VPKKNWWSSK SKEKKHIWFG ETINGGFHFS YGDDNLAPNT ANVQMTFLRL 1380 
LSTEGSQNIT YHCKNSIAYL DEAAGNLKKA LLIQGSNDVE IRAEGNSRFT YTALKDGCTK 1440 
HTGKWGKTVI EYRSQKTSRL PIIDIAPMDI GGPEQEFGVD IGPVCFL 



SEQ ID HO:125 PFH9 DNASEQUENCE 

Nucleic Acid Accessions NM.O05084 

Coding sequence: 162-1487(underlined sequences correspond to slart and stop codons) 

1 11 21 31 41 51 
I I I I ! I 

GCTGGTCGGA GGCTCGCAGT GCTGTCGGCG AGAAGCAGTC GGGTTTGGAG CGCTTGGGTC 60 
GCGTTGGTGC GCGGTGGAAC GCGCCCAGGG ACCCCAGTTC CCGCGAGCAG CTCCGCGCCG 120 
CGCCTGAGAG ACTAAGCTGA AACTGCTGCT CAGCTCCCAA GATGGTGCCA CCCAAATTGC 180 
ATGTGCTTTT CTGCCTCTGC GGCTGCCTGG CTGTGGTTTA TCCTTTTGAC TGGCAATACA 240 
TAAATCCTGT TGCCCATATG AAATCATCAG CATGGGTCAA CAAAATACAA GTACTGATGG 300 
CTGCTGCAAG CTTTGGCCAA ACTAAAATCC CCCGGGG AAA TGGGCCTTAT TCCGTTGGTT 360 
GTACAGACTT AATGTTTGAT CACACTAATA AGGGCACCTT CTTGCGTTTA TATTATCCAT 420 
CCCAAGATAA TGATCGCCTT GACACCCTTT GGATCCCAAA TAAAGAATAT TTTTGGGGTC 480 
TTAGCAAATT TCTTGGAACA CACTGGCTTA TGGGCAACAT TTTGAGGTTA CTCTTTGGTT 540 
CAATGACAAC TCCTGCAAAC TGGAATTCCC CTCTGAGGCC TGGTGAAAAA TATCCACTTG 600 
TTGTTTTTTC TCATGGTCTT GGGGCATTCA GGACACTTTA TTCTGCTATT GGCATTGACC 660 
TGGCATCTCA TGGGTTTATA GTTGCTGCTG TAGAACACAG AGATAGATCT GCATCTGCAA 720 
CTTACTATTT CAAGGACCAA TCTGCTGCAG AAATAGGGGA CAAGTCTTGG CTCTACCTTA 780 
GAACCCTGAA AC AAGAG GAG GAGACACATA TACGAAATGA GCAGGTACGG CAAAGAGCAA 840 
AAGAATGTTC CCAAGCTCTC AGTCTGATTC TTGACATTGA TCATGGAAAG CCAGTGAAGA 900 
ATGCATTAGA TTTAAAGTTT GATATGGAAC AACTGAAGGA CTCTATTGAT AGGGAAAAAA 960 
TAGCAGTAAT TGG ACATTCT TTTGGTGGAG CAACGGTTAT TCAGACTCTT AGTGAAGATC 1020 
AGAGATTCAG ATGTGGTATT GCCCTGGATG CATGGATGTT TCCACTGGGT GATGAAGTAT 1080 
ATTCCAGAAT TCCTCAGCCC CTCTTTTTTA TCAACTCTGA ATATTTCCAA TATCCTGCTA 1140 
ATATCATAAA AATGAAAAAA TGCTACTCAC CTGATAAAGA AAGAAAGATG ATTACAATCA 1200 
GGGGTTCAGT CCACCAGAAT TCTGCTGACT TCACTTTTGC AACTGGCAAA ATAATTGGAC 1260 
ACATGCTCAA ATTAAAGGGA GACATAGATT CAAATGTAGC TATTGATCTT AGCAACAAAG 1320 
CTTCATTAGC ATTCTTACAA AAGCATTTAG GACTTCATAA AGATTTTGAT CAGTGGGACT 1380 
GCTTGATTGA AGGAGATGAT GAGAATCTTA TTCCAGGGAC CAACATTAAC ACAACCAATC 1440 
AACACATCAT GTTACAGAAC TCTTCAGGAA TAGAGAAATA CAATTAG.GAT TAAAATAGGT 1500 



MVPPKLHVLF CLCGCLAVVY PFDWQYINPV AHMKSSAWVN KIQVLMAAAS FGQTKIPRGN 60 
GPYSVGCTDL MFDHTNKGTF LRLYYPSQDN DRLDTLWIPN KEYFWGLSKF LGTHWLMGNI 120 
LR1XFGSMTT PANWNSPLRP GEKYPLVVFS HGLGAFRTLY SAIGIDLASH GFIVAAVEHR 180 
DRSASATYYF KDQSAAE1GD KSWLYLRTLK QEEETHIRNE QVRQRAKECS QALSLILDID 240 
HGKPVKNALD LKFDMEQLKD SIDREKIAVI GHSFGGATVI QTLSEDQRFR CG1ALDAWMF 300 
PLGDEVYSRI PQPLFF1NSE YFQYPANIIK MKKCYSPDKE RKMITKGSV HQNFADFTFA 360 
TGKI1GHMUC LKGDIDSNVA IDLSNKASLA FLQKHLGLHK DFDQWDCLIE GDDENL1PGT 420 
NINTTNQHIM LQNSSGIEKY N 
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SEQ ID NO:127 PFH8 DNA SEQUENCE 

Nucleic Acid Accession #: NM.01 5900 

Coding sequence: 32-1402 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CACGAGCGGC ACGAGGATTT CCAGCTCAGC GATGCCCCCA GGTCCCTGGG AGAGCTGCTT 60 
CTGGGTGGGG GGCCTCATTT TGTGGCTCAG CGTTGGAAGT TCAGGGGATG CACCTCCTAC 1 20 
CCCACAGCCA AAGTGCGCTG ACTTCCAGAG CGCCAACCTT TTTGAAGGCA CCGATCTCAA 180 
AGTCCAGTTT CTCCTCTTTG TCCCTTCGAA TCCTAGCTGT GGGCAGCTAG TAGAAGGAAG 240 
CAGTGACCTC CAAAACTCTG GGTTCAATGC CACTCTGGGA ACCAAACTAA TTATCCATGG 300 
ATTCAGGGTT TTAGGAACAA AGCCTTCCTG GATTGACACA TTTATTAGAA CCCTTCTGCG 360 
TGCAACGAAT GCTAATGTGA TTGCCGTGGA CTGGATTTAT GGGTCTACAG GAGTCTACTT 420 
CTCAGCTGTG AAAAATGTGA TTAAGTTGAG CCTCG AGATC TCCCnTTCC TCAATAAACT 480 
CCTGGTGCTG GGTGTGTCGG AATCCTCAAT CC ACATCATT GGTGTTAGCC TGGGGGCCCA 540 
CGTTGGGGGC ATGGTGGGAC AGCTCTTCGG AGGCCAGCTG GGACAGATCA CAGGCCTGGA 600 
CCCCGCTGGA CCTGAGTACA CCAGGGCCAG TGTGGAAGAG CGCTTGGATG CTGGAGATGC 660 
CCTCTTCGTG GAAGCCATCC ACACAGACAC CGACAATTTG GGTATTCGGA TTCCCGTTGG 720 
ACATGTGGAC TACTTCGTCA ACGGAGGCCA AGACCAACCT GGCTGCCCCA CCTTCTTTTA 780 
CGCAGGTTAT AGTTATCTGA TCTGTGATCA CATGAGGGCT GTGCACCTCT ACATCAGCGC 840 
CCTGGAGAAT TCCTGTCCAC TGATGGCCTT TCCCTGTGCC AGCTACAAGG CCTTCCTTGC 900 
TGGACGCTGT CTGGATTGCT TTAACCCTTT TCTGCTTTCC TGCCCAAGGA TAGGACTGGT 960 
GGAACAAGGT GGTGTCAAGA TAGAGCCGCT CCCCAAGGAA GTGAAAGTCT ACCTCCTGAC 1020 
TACTTCCAGT GCTCCGTACT GCATGCATCA CAGCCTCGTG GAGTTTCACT TGAAGGAACT 1080 
GAGAAACAAG GACACCAACA TCGAGGTTAC CTTCCTTAGC AGTAACATCA CCTCTTCATC 1140 
TAAGATCACC ATACCTAAGC AGCAACGCTA TGGGAAAGGA ATCATAGCCC ATGCCACCCC 1200 
ACAATGCCAG ATAAACCAAG TGAAATTCAA GTTTCAGTCT TCCAACCGAG TTTGGAAAAA 1260 
AGACCGGACT ACCATTATTG GGAAGTTCTG CACTGCCCTT TTGCCTGTCA ATGACAGAGA 1320 
AAAGATGGTC TGCTTACCTG AACCAGTGAA CTTACAAGCA AGTGTGACTG TTTCCTGTGA 1380 
CCTGAAGATA GCCTGTGTGT.AGTTTAACCT GGGCAGGACA CATCTCCCTG CATTTTTTTT 1440 
TTTTTTTTTT GAGAGAGAGG TGTGATGAGG GATGTGTGTG TGCAGCTTAT TGTAGACCAT 1500 
TACTACTA AG GAGAAAAGCA AAGCTCTTTC TTATTTTCCT CATAATCAGC TACCCTGGAG 1560 
GGGAGGGAGA ACTCATTTTA CAGAACTTGG TTTCCTTTGC CGATCTTATG TACATACCCA 1620 
TTTTAGCTTT CCCATGCATA CTTAACTGCA CTTGCTTTAT CTCCTTGGGC ATTCGTACTT 1680 
AGG A1TCAAT AGAAACATGT ACAGGGTAAA CAATTTTTTA A, 
AAAAAAAAAA AJ 



1 11 21 31 41 51 
I I I I I I 

MPPGPWESCF WVGGLILWLS VGSSGDAPPT PQPKCADFQS ANLFEGTDLK VQFLLFVPSN 60 
PSCGQLVEGS SDLQNSGFNA TLGTKLBHG FRVLGTKPSW IDTFIRTLLR ATNANVIAVD 120 
WIYGSTGVYF SAVKNVIKLS LEISLFLNKL LVLGVSESSI HIIGVSLGAH VGGMVGQLFG 180 
GQLGQITGLD PAGPEYTRAS VEERLDAGDA LFVEAIHTDT DNLGIRIPVG HVDYFVNGGQ 240 
DQPGCPTFFY AGYSYLICDH MRAVHLYISA LENSCPLMAF PCAS YKAFLA GRCLDCFNPF 300 
LLSCPRIGLV EQGGVKIEPL PKEVKVYLLT TSSAPYCMHH SLVEFHLKEL RNKDTNIEVT 360 
FLSSNITSSS K3TIPKQQRY GKGIIAHATP QCQINQVKFK FQSSNRVWKK DRTTIIGKFC 420 
TALLPVNDRE KMVCLPEPVN LQASVTVSCD LKIACV 



SEQ ID NO:129 PFH7 DNA SEQUENCE 

Nucleic Acid Accession*: NMJ14384 
60 Coding sequence: 89-1 336 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

65 CGTTGCCGGG TCGCAGGTCC CGCCAGTGCG AGCGCAACGG AGGTCGAAGG CGTTCAGACT 60 
CTTAGCTGAA CGCGGAGCTG CGGCGGCTAXQCTGTGGAGC GGCTGCCGGC GTTTCGGGGC 120 
GCGCCTCGGC TGCCTGCCCG GCGGTCTCCG GGTCCTCGTC CAGACCGGCC ACCGGAGCTT 1 80 
GACCTCCTGC ATCGACCCTT CCATGGGACT TAATGAAGAG CAGAAAGAAT TTCAAAAAGT 240 
GGCCTTTGAC TTTGCTGCCC GAGAGATGGC TCCAAATATG GCAGAGTGGG ACCAGAAGGA 300 

70 GCTGTTCCCA GTGGATGTGA TGCGGAAGGC AGCCCAGCTA GGCTTCGGAG GGGTCTACAT 360 
ACAAACAGAT GTGGGCGGGT CTGGGCTGTC ACGTCTTGAT ACCTCTGTCA TTTTTGAAGC 420 
CTTGGCTACA GGCTGCACCA GCACCACAGC CTATATAAGC ATCCACAACA TGTGTGCCTG 480 
GATGATTGAT AGCTTCGGAA ATGAGGAACA GAGGCACAAA TTTTGCCCAC CGCTCTGTAC 540 
CATGGAGAAG TTTGCTTCCT ACTGCCTCAC TGAACCAGGA AGTGGGAGTG ATGCTGCCTC 600 

75 TCTTCTGACC TCCGCTAAGA AACAGGGAGA TCATTACATC CTCAATGGCT CCAAGGCCTT 660 
CATCAGTGGT GCTGGTGAGT CAGACATCTA TGTGGTCATG TGCCGA ACAG GAGGACCAGG 720 
CCCCAAGGGC ATCTCATGCA TAGTTGTTGA GAAGGGGACC CCTGGCCTCA GCTTTGGCAA 780 
GAAGGAGAAA AAGGTGGGGT GGAACTCCCA GCCAACACGA GCTGTGATCT TCGAAGACTG 840 
TGCTGTCCCT GTGGCCAACA GAATTGGGAG CGAGGGGCAG GGCTTCCTCA TTGCCGTGAG 900 
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AGGACTGAAC GG AGGGAGGA TCAATATTGC TTCCTGCTCC CTGGGGGCTG CCCACGCCTC 960 
TGTCATCCTC ACCCGAGACC ACCTCAATGT CCGGAAGCAG TTTGGAGAGC CTCTGGCCAG 1020 
TAACCAGTAC TTGCAATTCA CACTGGCTGA TATGGCAACA AGGCTGGTGG CCGCGCGGCT 1080 
GATGGTCCGC AATGCAGCAG TGGCTCTGCA GOAGGAGAGG AAGGATGCAG TGGCCTTGTG 1 140 
CTCCATGGCC AAGCTCTTTG CTACAGATGA ATGCTTTGCC ATCTGCAACC AGGCCTTGCA 1200 
GATGCACGGG GGCTACGGCT ACCTGAAGGA TTACGCTGTT CAGCAGTACG TGCGGGACTC 1260 
CAGGGTCCAC CAGATTCTAG AAGGTAGCAA TGAAGTGATG AGGATACTGA TCTCTAGAAG 1320 
CCTGCTTCAG GAGTAGAACC CACACTTGTT CTGGCCTGGT GTTCAGTGCG ACTGCAGTCA 1380 
GTGTTGAGTG GTGCCATGTG GGCCGCTCTA TTCCAAAGGA ATCATGGATT AGACCCAAGG 1440 
GCTGAGCTCC TCTAGGGCAG GACCTGCACC CTGTGTGTTG GCACCAGCAT CGGGTCTTGG 1500 
ACTGGGGCAG AATCCCCAGT GGAACCGGAA GAGCTGGACT GATGAGAAAC ATCAGAAGAA 1560 
CACATACTAC CTTGTTTTCC TAATGCCAGA AGGGTGACCA GTGAAGATTC ACCGTCAAAC 1620 
CATGAAAGTC CTTTCTTGGA TCCACTTTAT C1TGATTAGT CTGCATTTTA CTAGTTCACT 1680 
GGATCCCTCC TCTAGGGGCC TGGGGACTTT CACTGATGCT CTTCCTGATT CTAGAGCAAA 1740 

TACAG ATGCA GAAGGTTTCT CTGGATAGCA CACCTCTGAA TGTAAATCAT GATAAAATGG 1 860 
ATATTTG G A A ACTTACTCCT AAGCTGTGAT GTAGGGTGTA TTTCTACTTC TGGACTGCCT 1920 
CAATATCAAG GGCTGAGACT TTTGAATGTT GAATATTCGT TGGGTTTCAT GTTAAGACGC 1980 
CTGTGGTCCA GGAGTGCTAT TCAGTGTTTC TGTTCCTGAT AAACACTTTG AATA 1111I1 2040 
TGTGTTTTTG TTTCCTTTTC TGAAGCTGTT CCTCCTTTTA AATATITTTA ATCACATTGA 2100 
TAAAATCTAT CCTTCATCCA CCTCTGGTTC TACTATAGTT GATTTTTATT TTAAATGTTT 2160 
AATTGTATTT GATTAAACAC TTAACTGGAT TTTGGAATAA TAAAACTCTC GTCCAATTTG 2220 
GCTtTTAAAA AAAAAAAA 

SEQ ID H0:130 PFH7 Protein sentience: 



1 11 21 31 41 51 
I I I I I I 

MLWSGCRRFG ARLGCLPGGL RVLVQTGHRS LTSCIDPSMG LNEEQKEFQK VAFDFA AREM 60 
APNMAEWDQK ELFPVDVMRK AAQLGFGGVY IQTDVGGSGL SRLDTSV1FE ALATGCTSTT 120 
AYISIHNMCA WMIDSFGNEE QRHKFCPPLC TMEKFASYCL TEPGSGSDAA SLLTSAKKQG 180 
DHYILNGSKA FISGAGESDI YVVMCRTGGP GPKGISCIVV EKGTPGLSFG KKEKKVGWNS 240 
QPTRAVIFED CAVPVANRIG SEGQGFLIAV RGLNGGR1NI ASCSLGAAHA SVILTRDHLN 300 
VRKQFGEPLA SNQYLQFTLA DMATRLVAAR LMVRNAAVAL QEERKDAVAL CSMAKLFATD 360 
ECFAICNQAL QMHGGYGYLK DYAVQQYVRD SRVHQILEGS NEVMRHJSR SLLQE 



SEQ ID N0:131 PFH6 DNA SEQUENCE 

NMJJ13989 

Coding sequence: 707-1 1 05(undeitined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 60 
GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 120 
AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 180 
GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 
CATAGAG ACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 300 
TTGGAAATCA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGGTA ACTGGATTAG 360 
GGACGAGTAC GCCAGCTTTT TTTTTTTTTT TTTTTTTTTT TTTAACATCT TAAATCCTGA 420 
AAAAAAAAAA AAAAAAAAAA AAAAGGCAGC AGCTCCGAAT TGAATGAATT GATGGGCACA 480 
CTCCAACTGC TGGGCTGGAG AGACTGGACT TAGTCTTGCC ATTTCTGCTT CTTTGAAAGA 540 
GGAGACAACT TGGGCTTCCT TTTAATTTAG TTTTTTTT CC CCTTCTCCCC CAACCCCCAA 600 
CCTTCCCCCT TACCTCCCCC ACCCCCTTTA TCACCACCCC CCTTTTA AAT AAGAGGGTGA 660 
AGGGGAACCA GAGCGCACAA GGGAACTGAC TCAGGAGGCA GAGAAGATGG GC ATCCTCAG 720 
CGTAGACTTG CTGATCACAC TGCAAATTCT GCCAGTTTTT TTCTCCAACT GCCTCTTCCT 780 
GGCTCTCTAT GACTCGGTCA TTCTGCTCAA GCACGTGGTG CTGCTGTTGA GCCGCTCCAA 840 
GTCCACTCGC GGAGAGTGGC GGCGCATGCT GACCTCAGAG GGACTGCGCT GCGTCTOGAA 900 
GAGCTTCCTC CTCGATGCCT ACAAACAGGT GAAATTGGGT GAGGATGCCC CCAATTCCAG 960 
TGTGGTGCAT GTCTCCAGTA CAGAAGGAGG TG ACAACAGT GGCAATGGTA CCCAGGAGAA 1020 
GATAGCTGAG GGAGCCACAT GCCACCTTCT TGACTTTGCC AGCCCTGAGC GCCCACTAGT 1080 
GGTCAACTTT GGCTCAGCCA CT1QACCTCC TTTCACGAGC CAGCTGCCAG CCTTCCGCAA 1140 
ACTGGTGGAA GAGTTCTCCT CAGTGGCTGA CTTCCTGCTG GTCTACATTG ATGAGGCTCA 1200 
TCCATCAGAT GGCTGGGCGA TACCGGGGGA CTCCTCTTTG TCTTTTGAGG TGAAGAAGCA 1260 
CCAGAACCAG GAAG ATCGAT GTGCAGCAGC CCAGCAGCTT CTGGAGCGTT TCTCCTTGCC 1320 
GCCCCAGTGC CGAGTTGTGG CTGACCGCAT GGACAATAAC GCCAACATAG CTTACGGGGT 1380 
AGCCTTTGAA CGTGTGTGCA TTGTGCAGAG ACAGAAAATT GCTTATCTGG GAGGAAAGGG 1440 
CCCCTTCTCC TACAACCTTC AAGAAGTCCG GCATTGGCTG GAGAAGAATT TCAGCAAGAG 1500 
ATGAAAGAAA ACTAGATTAG CTGGTTAAAG GTATGATTAT AAGAGAGCTT ATTGTTTTAA 1560 
AAAGTTATAT AAAGGCAAGG AAATTAAGAA CTGAATCCAT ATTTCAACAG AGCCCTATTG 1620 
GCTTACTGAA AGACAGGAGT TTATCTATCG GAAGAACATG AATCTCTAAC AGCTCCATAC 1680 
TTCTTTCACT ACTCAAATGG CATTGGGCTG AGTAAGTAAC CATATCACCT CTCTTCTTAG 1740 
TAAAAAGCCC TATGTGAAAA GATCCCAAGA TGGAGAGGAA GAAACGCTAA TTCAGCATGT 1800 
GTTCATTCTG CATTGAGAAG GAACTGATAC ATCTGATGCA TGCTTTGAGA CCAGAAGAAA I860 
AGACTTACCT GAATAATTAC TACATTAGGG AAGCTACTGT CTACGTTAAG ATAAAGGGTA 1920 
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TTGCCTTGGC TCTATTTGGC ATGGATGGAG CCCAGTTGGA AAATTCCCAA ATATTACAAC 1980 
AAGTCCTTGA ACCCAGGCCA TGTGGTTAGA CGTTGGTGTT AAGGTTAGAC CTTATGTTAG 2040 
AGTCATTTCT G ATGTTCCAG CTTCTAGCCA TGTAGTGCTC TCAGTCTTCA TACCCCAGAA 2100 
ATTATTGGTA TATTTGTAGA TACCGAGAAT GATCCCTCAG TCTGAGAGGT TAGAATGATC 2 160 
5 ATCTGTAATC TGAGGGTTAA TTTCTAGGCA GGTGGAGAGA GTGGTAAAAA AGAAATGAAA 2220 
TTGACAAGCT AGGAAAGAGG AGGCAGAAAG ATTTGGAAAA TTCACAGAGT TTCACCCTTA 2280 
AGCTGTAGAG AGTGGGTCAC ATTTGTTAGC CACGGAAACA TAGAAACATA CACAAGGCCA 2340 
GAAAAAGAAG AAGGAGCTCA ACTAAAAGTG GCATAGAGAA TACACATATA AAAACAATAT 2400 
ATTTGTCATA TGCTCCTAGA GAGGAGAAAG GGGTGATTGA AAGAAAAAAA AATACTTAAA 2460 
1 U TATTTGTAAT TGTGAGGGGT TTCTTTTGGA AATAATTACT TTTGAACCAT GTATGTGGTA 2520 
TGTATATTTT CAGTGGGTTA ATTATACCCC ATGATACCTA TTAAAGGAAA ACCAGTGGGT 2580 
CTGGTGGTGC TGGTCTTITC CTCCCCATTC CTACAATTTC TATGTGGCCC AAGTCATTCC 2640 
TAATCTTGGT CTCTATAGCA GTGTTCTCTC TGAATGCTGA GCTGAAGAAA TTATACGTAC 2700 
ATACACACAT ACATACATAC ATACAAATAT ATGTATATAT ATTCTCAGCT GCTGCGGGAG 2760 

15 GTAGGTACCA TGGCCATTCA GCACAGCCTT GATTTCCTCC CAAAGTAGGT GAGCTATAGT 2820 
GAAGAATAGG TGCAAACAAA CAAGCTTACT TCCATTGCAA AATAGAAGAA GAGGAAGTTA 2880 
GAGATAATTC TGATCAATCA TTTTGGAGGC TTTGTTATAA GGCAACCCCC GGTATATCAT 2940 
GGAATTTCCA TTGACATTTG AATTTGGACT TGGATCTTCC CTTGGTCCCA TTAGCTGAGG 3000 
TTTAGTAATC TAAAGTCCCT ATAGTATATG ATTATAATGC TATTTTAAAA AATATATATA 3060 

20 TAAAATATTT TTITCTTTTT AAAATAGACA CTATAGTTTT ACCCATAAGT AATATTTAAA 3120 
GATTATAGCT CCCAAAAG AA TGG ACCAACC ACTTTCGTAT CATAATTTCT TTTTGGTAAA 3 1 80 
TATGAGACTA TTATGAAATC ATAGTATATG ATTGTATTTA AAGGTACAAT CAAAGGATCT 3240 
TTTGTCCATT CCATTAATAA CTGAATAAAA AATAAATAAA ATGGATAGAA AAAAACTAAA 3300 
GTTGAAAATA CATTCTTAAA CTAGTTGTCT GAAATGAGAA AAGAGTGAGA ACTAGGTGTG 3360 

25 CAAGAACCAA ACGTATTTTA TTTTATnTT TAAATGGGAG CAACATATCA GTCGTGTCAC 3420 
CAGCTGGTAT ATTGTGTAAA TATTAAAGCT CCATTGGGAC TGATTTTTCA TGGCAACATC 3480 
AGCTTTCTAA TGTTCTAAAT TCTATAAAAA CCACCCACAA AGAAACAAAG CAAATTTCAT 3540 
TATCTAATGA GTTGCTGGAA AATCATATTG AGAATAATTA TTTCAGATTC CTCAGTTGTT 3600 
AACTTCTACA TTCAAGGGCT TATCTCTGCC CCCATTGATT TTTAACCTCA AAATGGTGTG 3660 

30 AGATTTACTG TGGAACCCTA AAGCAGTAAA ATAAAAAACC TGGTTGCAGC ACATTCACAC 3720 
TGTTGTCCTT AAAATTCCCC TTTTTTCTCT ATGTACGATA AAGTAACAGT ATGTCAGATA 3780 
AGCCGGTGGG GGGATGAGAT TAGGCTGAGG CAGTGCTAGT CAACTGGGGG AAAAGGATGA 3840 
TGGAAAAATC ACCCAGTTGT GCTATATTTT TAAAGAAGG A GGTCGTTTAT GTGTGCAGAC 3900 
AATTCTCCCT GAGGTTAGCC CAATGGAGAA ATGAAGCAGA GGAAGGAAAC ATAGAAAGAC 3960 

35 ATGGGCTATC AGGG AGGAAG ATGTTCAATA GAACATGCAA GAATTTCTGG AAGAAAGGCT 4020 
GTGGAAGGGC CAATGGAGAA AATGAATGGA CAAAGCTCAG GAATCCCTAC OCTATGTAGA 4080 
ATGTTCTTGG TGTTATCAGG GTTAAGCCCT GTAATTATGT AACCTATTTA TCGCAACATG 4140 
AATTT7TATG ATTTCTTGTG ATGTATTCTT TTATGAAATT AACAAGAACT CATTATTTTG 4200 
AGGTAGAGGA AAATCAATGC TTTATCTG AT ATGCTGAGAA ATTATTAGAT TGCCAATACT 4260 

40 CATGTGCGTT TCATGTGTTT TATA AGGTTT GTTCCTTTGA AGAATTGTAG TTCTTAGTCC 4320 
CACAGGGAAA TGTGTATCTA TTTATATATC ATAGTATAAA TCTATGATAT ATTTATATCA 4380 
TATATAAAAG TCTGAGTTCT CTTTCTTAGT CCCTA ATC AT GTTTCTCCC A TAGGCTGTGT 4440 . 
TTACATGGAG CTATCGGTTT AGCCTTTTAA GCTTCATTAG CTTGTCTATT ATTGAAATAG 4500 
TTTCCAAGAA ATTTTAG ATA TTATCATAAC ATCTGGGTCT ACTCAAACAC TTATTGTTTG 4560 

45 AAAGACTTAT GTCTTGGACC TATCAAAAAC TGACTTTATT TATTGCTTAG TGAAAATACT 4620 
AGTGGGATCA ACAATG ATTT TCTTGA ATGG GCATGAATGG AGATGCCCGC ACAGTAATGT 4680 
AGAAATGTTT CATACAGCTA TTAAAATGTA ACTGACCTCC TTAGAGGCAG ATTAGTAACT 4740 
GTTCCTACTT TGTATAGCTA AGTGACAGTC ACTTAACTTA CATGACTTTC TTTTTTCACA 4800 
TTGGGTCTCT GGTCCTGTGT CTTCACCTCA TTTATAGCAC GTCTCCTTGA TTTTTGGTAG 4860 

5 0 TATCAACTTC CCAGTG ATCT GTTC AGTTAA GTTCTTCTCC CGTTAACCAG G AAGTGCTTA 4920 
TTCTCTCATC ACAGTGGGAA GAATAGCCTA TTGTCTTTCA TTTTGCCTGA GTGTATTTTA 4980 
CTATTTGGGC TCTGAAATAA AAATTATGAA ATATGGTGAG GTCACATGTT GGTGCTGCCT 5040 
TGCTGCATAA AATTCTAGGA GGGCAGGTTA GGAGACAGTT ATGTATGGCC TTTCGGGAAA 5100 
ATTCAAAGGG TGGGATTACA AGGGTGTTCC TCAGGCATGC CCCTATGGGC CCTATGTGGA 5160 

55 AGCAAGAAG A ATTG ACTGAT TTACAGG ACT TCTCTTTATG TCAATCTTAA GAGGATGGAT 5220 
GAATCTGGAC ATTTGTTCCA CCCGACCTCT GACTGATGGT TTGGAAAATA ACTTTAATTA 5280 
GGATCATATG ACCATTGAAA AAGGAAAAAT GTAGACTCTG ACTTCCGTCC CACTGAAGGA 5340 
TTAATGAAAA CCTTTACTAG CATTTAGAGC TTTTCAGAAC ATCCCCACTG TCATGTGTCT 5400 
CAGCAGTGGA GACTGCAAGT AAGGCTTTTA ATTTTAGGAG GTTTTTTTTT TTTTTTTTTT 5460 

60 TTCCCCTAAA TGGTATGGCC AAAAGTCAGA GTTAAAATAT ATATAGTTAG ATTCGAACTT 5520 
CCTCCTTCAC TCTAAAAATA GAATCCAAAC CCACTCTTCA TATATGCTTC CAGAATGGGG 5580 
CTTAAGTACC AATCTCTGCT TTGCAATGGG CACAATCTTG GTCATGTCCT GAGGCTCTCT 5640 
AAGAAAAGAG AGGATCTAGG ATGGGAGAGC TAGAAAGTTG CTAACTGGGA AGAACAAGGC 5700 
CCTGAGGGGT TGGTCTACC A ATCTGGGAAG ATTTG AAAAC A AACTTCTCG CAACTGAAGG 5760 

65 AAGGCTGAAG GCTGCTGCAA GTCATTGAGT GACTTTAGGA TGAGCAAAAC ATTGGGCCAC 5820 
TTCCTAATGC CCTATGTGTA TAGTACCAGA AGCAAGGTCT CAGACTTAAC AGACCCAGCT 5880 
CTGTTCCAAG GTGAGTCTGA ACCAATAGAA AGCAAACATG TGCAGATATC CAAACAAGAC 5940 
TGCTCATGCA AGTCGGGGCT GGCTACCCGT CTTAGGCAGC AAC AGCAGAG CTCCAGGGAG 6000 
CTTATTCAAT ATTTACTGAG ACTTCGAAGA CCCAGCAGAT GTTTAATGAA GTCACTATTT 6060 

70 TGGCTCAAAC CCTCCACTTC TCCCCCTCCC CTCAAAAAGC CAACAGGTAA ACACATAAAT 6120 
GAAAGAAACC CACAGAAGGG GATGGGAAAT AAAGAAAATT CTCTCAAGAC TTCTCCAGGC 6180 
CCATGTC ACT GGTCAGCGTG GTTTTTATGT GTATTAGGAT TGGGGGATGT GAAGAAATAA 6240 
GTATCCAGTA CTTTATAACC AAAGCAATTA AATGATATTG GGGTAGGGAA TGTTGGCCAG 6300 
TTTTGTTTAG TTTTGCCATC ACATTGTCAC CC AG ACCTC A CCTAGCCCCA AGTAATCGGG 6360 

7 5 CGCCCCGA AG AGGG AG ACAG AGATGTGCC A GAGTTGACCC AGTGTGCGGA TG ATAACTAC 6420 
TGACGAAAGA GTCATCGACC TCAGTTAGTG GTTGGATGTA GTCACATTAG TTTGCCTCTC 6480 
CCCATCTTTG TCTCCCTGGC AAGGAGAATA TGCGGGACAT GATGCTAAGA GCCCTGGGTA 6540 
AATGTGGTGA GAATGCACGC GTGCATATGC TACACATATG TGCTTCTCAG TTGCAGAAAA 6600 
TGAACTGCTT TGGG AGATTA TCAGTAGAAA GAGTGTTATC ATATTGGTGC TGAGTGCTAT 6660 
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GTGTGCTTAT ACAATTTGTT CTTGTATTTT AATAAACTTT GAATAAAAGA ATAAAAAAAA 6720 
AAAAAAAAAA AAAAA 



5 SEQIDNO:132PFH6 Prolelnseq Ue nc e: 
Protein Accession* NP.054644.1 

1 11 21 31 41 51 

10 MGILSVDLLI TLQILPVFFS NCLFLALYDS VILLKHVVLL LSRSKSTRGE WRRMLTSEGL 60 

RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 120 
ERPLVVNFGS ATXPPFTSQL PAFRKLVEEFSSVADFLLVY IDEAHPSDGW AIPGDSSLSF 180 
EVKKHQNQED RCAAAQQLLE RFSLPPQCRV VADRMDNNAN IAYGVAFERV CP/QRQKIAY 240 
LGGKGPFSYN LQEVRHWLEK NFSKRXKKTR LAG 



SEQ ID N0:133 PFH5 DNA SEQUENCE 

20 Nucleic Acid Accession ft NM.001141 

Coding sequence: 72-2102 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
-.111111 

25 CAGGCGTGTC CCAGGGGGAG CCCCGCTCTG CAGCCCTGTG CGCCGTAGAG AGCTGGACTT 60 

AGGCTGGCAG CMSGCCGAG TTC AGGGTCA GGGTGTCCAC CGGAGAAGCC TTCGGGGCTG 120 - 
GCACATGGGA CAAAGTGTCT GTCAGCATCG TGGGGACCCG GGGAGAGAGC CCCCCACTGC 180 
CCCTGGACAA TCTCGGCAAG GAGTTCACTG CGGGCGCTGA GGAGGACTTC CAGGTGACGC 240 
TCCCGGAGGA CGTAGGCCGA GTGCTGCTGC TGCGCGTGCA CAAGGCGCCC CCAGTGCTGC 300 

30 CCCTGCTGGG GCCCCTGGCC CCGGATGCCT GGTTCTGCCG CTGGTTCCAG CTGACACCGC 360 
CGCGGGGCGG CCACCTCCTC TTCCCCTGCT ACCAGTGGCT GGAGGGGGCG GGGACCCTGG 420 
TGCTGCAGGA GGGTACAGCC AAGGTGTCCT GGGCAGACCA CCACCCTGTG CTCCAGCAAC 480 
AGCGCCAGGA GGAGCTTCAG GCCCGGCAGG AGATGTACCA GTGGAAGGCT TACAACCCAG 540 
GTTGGCCTCA CTGCCTGGAT GAAAAGACAG TGGAAGACTT GGAGCTCAAT ATCAAATACT 600 

35 CCACAGCCAA GAATGCCAAC TTTTATCTAC AAGCTGGCTC TGCTTTTGCA GAGATGAAAA 660 
TCAAGGGGTT GCTGGACCGC AAGGGGCTCT GGAGGAGTCT GAATGAGAl'G AAAAGGATCT 720 
TCAACTTCCG GAGGACCCCA GCAGCTGAGC ACGCATTTGA GCACTGGCAG GAGGATGCCT 780 
TCTTCGCCTC CCAGTTCCTG AATGGTCTCA ACCCTGTCCT GATCCGCCGC TGTCACTACC 840 
TCCCAAAGAA CTTCCCCGTC ACTG ATGCCA TGGTGGCCTC ATTGTTGGGT CCTGGGACCA 900 

40 GCTTGCAGGC TGAGCTAGAG AAGGGCTCCC TGTTCTTGGT GGATCACGGC ATCCTCTCTG 960 
GCATCCAGAC CAATGTCATT AATGGGAAGC CGCAGTTCTC TGCGGCCCCA ATGACCCTGC 1020 
TATACCAGAG CCCAGGCTGC GGGCCGCTGC TGCCTCTCGC CATCCAGCTC AGCCAGACCC 1080 
CCGGCCCAAA CAGCCCCATC TTCCTGCCCA CTGATGACAA GTGGGACTGG TTGCTGGCCA 1 140 
. _ AGACCTGGGT GCGCAATGCC GAGTTCTCCT TCCATGAGGC CCTCACGCAC CTGCTGCACT 1200 

45 CACATCTGCT GCCTGAGGTC TTCACCCTGG CTACCCTGCG TCAGCTGCCC CACTGCCACC 1260 
CTCTCTTCAA GCTGCTGATC CCGCACACCC GATACACCCT GCACATCAAC ACACTCGCCC 1320 
GGGAGCTGCT TATCGTGCCA GGGCAGGTGG TGGACAGGTC CACAGGCATC GGCA'ITGAAG 1380 
GCTTCTCTGA GTTGATACAG AGGAACATGA AGCAGCTGAA CTATTCTCTC CTGTGTCTGC 1440 
CTGAGGATAT CCGGACCCGA GGAGTTGAAG ACATCCCAGG CTACTACTAC CGTGATGATG 1500 

50 GGATGCAGAT TTGGGGTGCA GTGGAACGCT TTGTCTCTGA AATCATCGGT ATCTACTACC 1560 
CAAGTGATGA GTCTGTCCAA GATGACAGAG AGCTCCAGGC CTGGGTCAGA GAGATCTTCT 1620 
CCAAGGGCTT CCTAAACCAG GAGAGCTCAG GTATCCCTTC CTCACTGGAG ACCCGGGAAG 1680 
CCCTGGTGCA GTATGTCACC ATGGTGATAT TCACCTGCTC AGCCAAGCAT GCGGCTGTCA 1740 
GTGCAGGGCA GTTTGACTCC TGTGCTTGGA TGCCCAACCT GCCACCCAGC ATGCAGCTGC 1800 

55 CACCACCCAC CTCCAAAGGC CTGGCAACAT GCGAGGGCTT CATAGCCACC CTCCCACCTG 1860 
TCAATGCCAC ATGTGATGTC ATCCTTGCTC TCTGGTTGCT GAGCAAGGAG CCTGGAGACC 1920 
AAAGGCCCCT GGGCACCTAT CCGGATGAGC ACTTCACAGA GGAGGCCCCT CGGCGGAGCA 1980 
TCGCCACCTT CCAGAGCCGC CTGGCCCAGA TCTCGAGGGG CATCCAGGAG CGGAACCGGG 2040 
GCCTCGTGCT GCCCTACACC TACCTAGACC CTCCCCTCAT CGAGAACAGC GTCTCCATCT 2100 

60 AAATCCCAGG GGAACACAGG CCCAGATGAC ATCCCTTTGA CCACATCGCT CTAGGATAAC 2160 
TGGCACCCAG AGAAAAGGAC TCCTCAGAAA AAACAGGCCC CCATGTGCCT CTCCTGGGAC 2220 
AACCAGACTC TGTAACTCAC CCCCACCACC ATACACACAC ACAAAAACAG AAACAAAATC 2280 
AAAACAG AGA AAGCAGAAAA TCTACCAAGA ACAGAGTCTC AGGACAGAAC CACTGAGTCT 2340 

„ TTTGGAGGCT CCAAGCCTCA AAGTGCCCGC AGAGCCCACC TTGAGGGTTT TGCTAGTTGG 2400 

65 TTTTGTnTG CGTTTACAGC CGTGGGGGGA AGCACATAAT CCCGCCCCAG GGCCCACTAG 2460 
CATCCACTGA TTGGACCTTA TGGTCACCCA ACTCAAGGAC AGCCACCAAG AAGTGGCTGC 2520 
CAAAGAGACT GGGCGCAGTG GCTCATGCCC ATAATCCCAG CACTTTGGGA GATGGAGGCG 2580 
GGAAAATCAT TTGAGGTCAG AAGTTCAAGG CCAGCCTGGA CGACATAGCG AGACTCCACC 2640 
TCTACCAAAA AATAAAAATT AAAAAACAAA AAAAAAAAAA AAAAA 



SEQ ID MO;134 PFH5 Protein sequence: 
Protein Accession* NP_001132.1 

75 1 11 21 31 41 51 

MAEFRVRVST GEAFGAGTWD KVSVSIVGTR GESPPLPLDN LGKEFTAGAE EDFQVTLPED 60 
VGRVLLLRVH KAPPVLPLLG PLAPDAWFCR WFQLTPPRGG HLLFPCYQWL EGAGTLVLQE 120 
GTAKVSWADH HPVLQQQRQE ELQARQEMYQ WKAYNPGWPH CLDEKTVEDL ELNIKYSTAK 180 



351 



WO 02/30268 



PCT/US01/32045 



NANFYLQAGS AFAEMKIKGL LDRKGLWRSL NEMKRIFNFR RTPAAEHAFE HWQEDAFFAS 240 
QFLNGLNPVL IRRCHYLPKN FPVTDAMVAS LLGPGTSLQA ELEKGSLFLV DHGILSGIQT 300 
NVINGKPQFS AAPMTLLYQS PGCGPLLPLA IQLSQTPGPN SPIFLPTDDK WDWLLAKTWV 360 
RNAEFSFHEA LTHLLHSHLL PEVFTLATLR QLPHCHPLFK LLIPHTRYTL HMTLARELL 420 
IVPGQVVDRS TGIGIEGFSE UQRNMKQLN YSLLCLPEDI RTRGVEDIPG YYYRDDGMQI 480 
WGAVERFVSE DGIYYPSDE SVQDDRELQA WVREIFSKGF LNQESSGIPS SLETREALVQ 540 
YVTMVIFTCS AKHAAVSAGQ FDSCAWMPNL PPSMQLPPPT SKGLATCEGF IATLPPVNAT 600 
CDVILALWLL SKEPGDQRPL GTYPDEHFTE EAPRRSIATF QSRLAQISRG IQERNRGLVL 660 
PYTYLDPPLIENSVSI 

SEQ ID N0:135 PFH4 DNA SEQUENCE 

Nucleic Acid Accession* NM.002742 

Coding sequence: 236-2974 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GAATTCCTTC TCTCCTCCTC CTCGCCCTTC TCCTCGCCCT CCTCCTCCTC CTCGCCCTCC 60 
CCTCCCGATC CTCATCCCCT TGCCCTCCCC CAGCCCAGGG ACTTTTCCGG AAAGTTTTTA 120 
TTTTCCGTCT GGGCTCTCGG AGAAAGAAGC TCCTGGCTCA GCGGCTGCAA AACTTTCCTG 180 
CTGCCGCGCC GCCAGCCCCC GCCCTCCGCT GCCCGGCCCT GCGCCCCGCC GAGCGATQAG 240 
CGCCCCTCCG GTCCTGCGGC CGCCCAGTCC GCTGCTGCCC GTGGCGGCGG CAGCTGCCGC 300 
AGCGGCCGCC GCACTGGTCC CAGGGTCCGG GCCCGGGCCC GCGCCGTTCT TGGCTCCTGT 360 
CGCGGCCCCG GTCGGGGGCA TCTCGTTCCA TCTGCAGATC GGCCTGAGCC GTGAGCCGGT 420 
GCTGCTGCTG CAGGACTCGT CCGGGGACTA CAGCCTGGCG CACGTCCGCG A( 
CTCCATTGTC GACCAGAAGTTCCCTGAATG TGGTTTCTAC GG A ATGTATG A' 

GcrnrrcGC catgacccta cctctgaaaa catccttcag ctggtgaaag cggccagtga 600 

TATCCAGGAA GGCGATCTTA TTGAAGTGGT CTTGTCACGT TCCGCCACCT TTGAAGACTT 660 
TCAGATTCGT CCCCACGCTC TCITTGTTCA TTCATACAGA GCTCCAGCTT TCTGTGATCA 720 
CTGTGGAGAA ATGCTGTGGG GGCTGGTACG TCAAGGTCTT AAATGTGAAG GGTGTGGTCT 780 
GAATTACCAT AAGAGATGTG CATTTAAAAT ACCCAACAAT TGCAGCGGTG TGAGGCGGAG 840 
AAGGCTCTCA AACGTTTCCC TCACTGGGGT CAGCACCATC CGCACATCAT CTGCTGAACT 900 
CTCTACAAGT GCCCCTGATG AGCCCCTTCT GCAAAAATCA CCATCAGAGT CGTTTATTGG 960 
TCGAGAGAAG AGGTCAAATT CTCAATCATA CATTGGACGA CCAATTCACC TTGACAAGAT 1020 
TTTGATGTCT AAAGTTAAAG TGCCGCACAC ATTTGTCATC CACTCCTACA CCCGGCCCAC 1080 
AGTGTGCCAG TACTGCAAGA AGdTCTGAA GGGGCTTTTC AGGCAGGGCI TGCAGTGCAA 1 140 
AGATTGCAGA TTCAACTGCC ATAAACGTTG TGCACCGAAA GTACCAAACA ACTGCCTTGG 1200 
CGAAGTGACC ATTAATGGAG ATTTGCTTAG CCCTGGGGCA GAGTCTGATG TGGTCATGGA 1260 
AGAAGGGAGT GATGACAATG ATAGTGAAAG GAACAGTGGG CTCATGGATG ATATGGAAGA 1320 
AGCAATGGTC CAAGATGCAG AGATGGCAAT GGCAGAGTGC CAGAACGACA GTGGCGAGAT 1380 
GCAAGATCCA GACCCAGACC ACGAGGACGC CAACAGAACC ATCAGTCCAT CAACAAGCAA 1440 
CAATATCCCA CTCATGAGGG TAGTGCAGTC TGTCAAACAC ACGAAGAGGA AAAGCAGCAC 1500 
AGTCATGAAA GAAGGATGGA TGGTCCACTA CACCAGCAAG GACACGCTGC GGAAACGGCA 1560 
CTATTGGAGA TTGGATAGCA AATGTATTAC CCTCTTTCAG AATGACACAG GAAGCAGGTA 1620 
CTACAAGGAA ATTCCTTTAT CTGAAATTTT GTCTCTGGAA CCAGTAAAAA CTTCAGCTTT 1680 
AATTCCTAAT GGGGCCAATC CTCATTGTTT CGAAATCACT ACGGCAAATG TAGTGTATTA 1740 
TGTGGGAGAA AATGTGGTCA ATCCTTCCAG CCCATCACCA AATAACAGTG TTCTCACCAG 1800 
TGGCGTTGGT GCAGATGTGG CCAGGATGTG GGAGATAGCC ATCCAGCATG CCCTTATGCC 1860 
CGTCATTCCC AAGGGCTCCT CCGTGGGTAC AGGAACCAAC TTGCACAGAG ATATCTCTGT 1920 
GAGTATTTCA GTATCAA ATT GCCAGATTCA AG AAAATGTG GAC ATCAGCA CAGTATATCA 1980 
GATTTTTCCT GATGAAGTAC TGGGTTCTGG ACAGTTTG G A ATTGTTTATG GAGGAAAACA 2040 
TCGTAAAACA GGAAGAGATG TAGCTATTAA AATCATTGAC AAATTACGAT TTCC AAC AAA 2100 
ACAAGAAAGC CAGCTTCGTA ATGAGGTTGC AATTCTACAG AACCTTCATC ACCCTGGTGT 2160 
TGTAAATTTG G AGTGTATGT TTG AGACGCC TGAAAGAGTG TTTGTTGTTA TGGAAAAACT 2220 
CCATGGAGAC ATGCTGGAAA TGATCTTGTC AAGTGAAAAG GGCAGGTTGC CAGAGCACAT 2280 
AACGAAGTTT TTAATTACTC AGATACTCGT GGCTTTGCGG CACCTTCATT TTAAAAATAT 2340 
CGTTCACTGT GACCTCAAAC CAGAAAATGT GTTGCTAGCC TCAGCTGATC CTTTTCCTCA 2400 
GGTGAAACTT TGTGATTTTG GTTTTGCCCG GATCATTGGA GAGAAGTCTT TCCGGAGGTC 2460 
AGTGGTGGGT ACCCCCGCTT ACCTGGCTCC TGAGGTCCTA AGGAACAAGG GCTACAATCG 2520 
CTCTCTAGAC ATGTGGTCTG TTGGGGTCAT CATCTATGTA AGCCTAAGCG GCACATTCCC 2580 
ATTTAATGAA GATGAAGACA TACACGACCA AATTCAGAAT GCAGCTTTCA TGTATCCACC 2640 
AAATCCCTGG AAGGAAATAT CTCATGAAGC CATTGATCTT ATCAACAATT TGCTGCAAGT 2700 
AAAAATGAGA AAGCGCTACA GTGTGGATAA GACCTTGAGC CACCCTTGGC TACAGGACTA 2760 
TCAGACCTGG TTAGATTTGC GAGAGCTGGA ATGCAAAATC GGGGAGCGCT ACATCACCCA 2820 
TGAAAGTGAT GACCTGAGOT GGGAGAAGTA TGCAGGCGAG CAGCGGCTGC AGTACCCCAC 2880 
ACACCTGATC AATCCAAGTG CTAGCCACAG TGACACTCCT GAGACTGAAG AAACAGAAAT 2940 
GAAAGCCCTC GGTGAGCGTG TCAGCATCCT CTQAGTTCCA TCTCCTATAA TCTGTCAAAA 3000 
CACTGTGGAA CTAATAAATA CATACGGTCA GGTTTAACAT TTGCCTTGCA GAACTGCCAT 3060 
TATTTTCTGT CAGATGAGAA CAAAGCTGTT AAACTGTTAG CACTGTTGAT GTATCTGAGT 3120 
TGCCAAGACA AATCAACAGA AGCATTTGTA TTTTGTGTGA CCAACTGTGT TGTATTAACA 3180 
AAAGTTCCCT G A A ACACGA A ACTTGTTATT GTG AATG ATT CATGTTATAT TTAATGCATT 3240 
AAACCTGTCT CCACTGTGCC TTTGCAAATC AGTGTTTTTC TTACTGGAGC TTCATTTTGG 3300 
TAAGAGACAG AATGTATCTG TGAAGTAGTT CTGTTTGGTG TGTCCCATTG GTGTTGTCAT 3360 
TGTAAACAAA CTCTTGAAGA GTCGATTATT TCCAGTGTTC TATGAACAAC TCCAAAACCC 3420 
ATGTGGGAAA AAAATGAATG AGGAGGGTAG GGA ATAA AAT CCTAAGACAC AAATGCATGA 3480 
ACAAGTTTTA ATGTATAGTT TTGA ATCCTT TGCCTGCCTG GTGTGCCTCA GTATATTTAA 3540 
ACTCAAGACA ATGCACCTAG CTGTGCAAGA CCTAGTGCTC TTAAGCCTAA ATGCCTTAGA 3600 
AATGTAAACT GCCATATATA ACAGATACAT TTCCCTCTTT CTTATAATAC TCTGTTGTAC 3660 
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5 SEQIDWIWPFWPrcteinjwwncg: 
Protein Accession*: NP.002733.1 



1 11 21 31 41 51 
I 



MSAPPVLRPP SPLLPVAAAA AAAAAALVPG SQPOPAPFLA PVAAPVGGIS FHLQIGLSRE 60 
PVLLLQDSSG DYSLAHVREM ACSIVDQKFP ECGFYGMYDK ILLFRHDPTS ENILQLVKAA 120 
SDIQEGDLIE VVLSRSATFE DFQIRPHALF VHSYRAPAFC DHCGEMLWGL VRQGLKCEGC 180 
GLNYHKRCAF KIPNNCSGVR RRRLSNVSLT GVSTIRTSSA ELSTSAPDEP LLQKSPSESF 240 
IGREKRSNSQ S YIGRP1HLD K1LMSKVKVP HTFVIHS YTR PTVCQYCKKL LKGLFKQGLQ 300 
CKDCRFNCHK RCAPKVFNNC LGEVTINGDL LSPGAESDVV MEEGSDDNDS ERNSGLMDDM 350 
EEAMVQDAEM AMAECQNDSG EMQDPDPDHE D ANRTISPST SNNIPLMRVV QSVKHTKRKS 420 
STVMKEGWMV HYTSKDTLRK RHYWRLDSKC ITLFQNDTGS RYYKEIPLSE ILSLEPVKTS 480 
ALIPNG AMPH CFEITTANVV YYVGENVVNP SSPSPNNSVL TSGVGADVAR MWE1AIQHAL 540 
MPVIPKGSSV GTGTNLHRDI SVSISVSNCQ IQENVDISTV YQIFPDEVLG SGQFGIVYGG 600 
KHRKTGRDVA IKHDKLRFP TKQESQLRNE V AILQNLHHP GWNLECMFE TPERVFVVME 660 
KLHGDMLEMI LSSEKGRLPE HITKFL1TQ1 LVALRHLHFK NIVHCDLKPE NVLLASADPF 720 
PQVKLCDFGF ARIIGEKSFR RSVVGTPAYL APEVLRNKGY NRSLDMWS VG VIIYVSLSGT 780 
FPFNEDEDIH DQIQNAAFMY PPNPWKEISH EAIDLINNLL QVKMRKRYSV DKTLSHPWLQ 840 
DYQTWLDLRE LECKIGERYI THESDDLRWE KYAGEQRLQY PTHLINPSAS HSDTPETEET 900 
EMKALGERVS IL 



30 SEQ ID NO:137PFH3DNA SEQUENCE 

Nucleic Acid Accession*: X95425 

Coding sequence: 71 2-3825 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

35 | | | | | | 

AATGGTCAGT CAATACATTA TAACATAATA CACCAAATGC TAGAATAGAA GGGGAGGGGG 60 
GCACACATAA TGACTCACTG CTGGAAGAAG GGTGCATCAG TGAATTAAAA AATGTCCCTC 120 
CCCTCTTCAG CACTCAGCGC GCAGCTATTT CCTTCTGCCA GTCTCTTTGA ACTCTGGATC 180 
TTTGCTTTTG CTCGCTGCTC TCCTGTTTTT CATTCTCCAC ATTTTCTCAA TCCTCTTTCT 240 

40 TTATCCTTAG CCACCCTGCT TTTTTCCTCC TnTTTAAAA AATCGGAGAT TTCGTCTTAA 300 
AATGATTTGT CTTCCTTACC TTCGTCCATT TCAACACTGA AGGCTGCAAA GAACTTCACC 360 
TTTCCCCTAG TGGTATTTAA AAATTCTCAA TCCGTAAAAA GTCTTTTTGA AAGGCAAAGG 420 
AACAGGACCC AGACCCTCTC GACACCCTTG ATCCGAGTCA GATCTGCACT AGCAACCAGA 480 
., ACTAATATTT CATTTA ACCC ACCAAAAGGG GG AGGCGAGA GGAGCCAGAA GCAAACTTCA 540 

45 TCTGTCTCAG ACGGATCCGT GGTTCCTACA TTTGGAGGAG CCGCGTGTCA GAAGGCGTAG 600 
GACCCCAAGG GGGGACAAGG AGGACTCCCG AGTCTCCCTT CTCCGCTCTC CGAGACCGAA 660 
GAGGTGGACT GAGCCGCTCG GGACAGCGGC ACCGGAGGAG GCTCGGAGAA GATGCGGGGC 720 
TCGGGGCCCC GGGGTGCGGG ACACCGGCGG CCCCCAAGCG GCGGCGGCGA CACCCCCATC 780 
ACCCCAGCGT CCCTGGCCGG CTGCTACTCT GCACCTCGAC GGGCTCCCCT CTGGACGTGC 840 

50 CTTCTCCTGT GCGCCGCACT CCGGACCCTC CTGGCCAGCC CCAGCAACGA AGTGAATTTA 900 
TTGGATTCAC GCACTGTCAT GGGGGACCTG GGATGGATTG CTTTTCCAAA AAATGGGTGG 960 
GAAGAGATIG GTGAAGTGGA TGAAAATTAT GCCCCTATCC ACACATACCA AGTATGC A AA 1020 
GTGATGGAAC AGAATCAGAA TAACTGGCTT TTGACCAGTT GGATCTCCAA TGAAGGTGCT 1080 
TCCAGAATCT TCATAGAACT CAAATTTACC CTGCGGGACT GCAACAGCCT TCCTGGAGGA 1140 

55 CTGGGGACCT GTAAGGAAAC CTTTAATATG TATTACTTTG AGTCAGATGA TCAGAATGGG 1200 
AGAAACATCA AGGAAAACCA ATACATCAAA ATTGATACCA TTGCTGCCGA TGAAAGCTTT 1260 
ACAGAACTTG ATCTTGGTGA CCGTGTTATG AAACTGAATA CAGAGGTCAG AGATGTAGGA 1320 
CCTCTAAGCA AAAAGGGATT TTATCTTGCT TTTCAAGATG TTGGTGCTTG CATTGCTCTG 1380 
GTTTCTGTGC GTGTATACTA TAAAAAATGC CCTTCTGTGG TACGACACTT GGCTGTCTTC 1440 

60 CCTGACACCA TCACTGGAGC TGATTCTTCC CAATTGCTCG AAGTGTCAGG CTCCTGTGTC 1500 
AACCATTCTG TGACCGATGA ACCTCCCAAA ATGCACTGCA GCGCCGAAGG GGAGTGGCTG 1560 
GTGOCCATCG GGAAATGCAT GTGCAAGGCA GGATATGAAG AGAAAAATGG CACCTGTCAA 1620 
GTGTGCAGAC CTGGGTTCTT CAAAGCCTCA CCTCACATCC AG AGCTGCGG CAAATGTCCA 1680 

, _ CCTCACAGTT ATACCC ATG A GG AAGCTTCA ACCTCTTGTG TCTGTG AAAA GGATTATTTC 1740 

65 AGGAGAGAGT CTGATCCACC CACAATGGCA TGCACAAGAC CCCCCTCTGC TCCTCGGAAT 1800 
GCCATCTCAA ATGTTAATGA AACTAGTGTC TTTCTGGAAT GGATTCCGCC TGCTGACACT 1860 
GGTGGAAGGA AAGACGTGTC ATATTATATT GCATGCAAGA AGTGCAACTC CCATGCAGGT 1920 
GTGTGTGAGG AGTGTGGCGG TCATGTCAGG TACCTTCCCC GGCAAAGCGG CCTGAAAAAC 1980 
ACCTCTGTCA TGATGGTGGA TCTACTCGCT CACACAAACT ATACCTTTGA GATTGAGGCA 2040 

70 GTGAATGGAG TGTCCGACTT GAGCCCAGGA GCCCGGCAGT ATGTGTCTGT AAATGTAACC 2100 
ACAAATCAAG CAGCTCCATC TCCAGTCACC AATGTGA AAA AAGGGAAAAT TGCAAAAAAC 2160 
AGCATCTCTT TGTCTTGGCA AGAACCAGAT CGTCCCAATG GAATCATCCT AGAGTATGAA 2220 
ATCAAGCATT TTG AAAAGGA CCAAGAGACC AGCTACACGA TTATCAAATC TAAAGAGACA 2280 

_ _ ACTATTACTG CAGAGGGCTT GAAACCAGCT TCAGTTTATG TCTTCCAAAT TCGAGCACGT 2340 

75 ACAGCAGCAG GCTATGGTGT CTTCAGTCGA AGATTTGAGT TTGAAACCAC CCCAGTGTTT 2400 
GCAGCATCCA GCGATCAAAG CCAGATTCCT GTAATTGCTG TGTCTGTGAC AGTAGGAGTC 2460 
ATTTTGTTGG CAGTGGTTAT CGGCGTCCTC CTCAGTGGAA GTTGCTGCGA ATGTGGCTGT 2520 
GGGAGGGCTT CTTCCCTGTG CGCTGTTGCC CATCCAATCC TAATATGGCG GTGTGGCTAC 2580 
AGCAAAGCAA AACAAGATCC AGAAGAGGAA AAGATGCATT TTCATAATGG GCACATTAAA 2640 
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CTGCCAGGAG TAAGAACTTA CATTGATCCA CATACCTATG AGGATCCCAA TCAAGCTGTC 2700 
CACGAATTTG CCAAGGAGAT AGAAGCATCA TGTATCACCA TTGAGAGAGT TATTGGAGCA 2760 
GGTGAATTTG GTGAAGTTTG TAGTGGACGT TTGAAACTAC CAGGAAAAAG AGAATTACCT 2820 
GTGGCTATCA AAACCCTTA A AGTAGGCTAT ACTGAAAAGC AACGCAGAGA TTTCCTAGGT 2880 
5 GAAGCAAGTA TCATGGGACA GTTTGATCAT CCTAACATCA TCCATTTAGA AGGTGTGGTG 2940 
ACCAAAAGTA AACCAGTGAT GATCGTGACA GAGTATATGG AGAATGGCTC TTTAGATACA 3000 
TTTTTGAAGA AAAACGATGG GCAGTTCACT GTGATTCAGC TTGTTGGCAT GCTGAGAGGT 3060 
ATCTCTGCAG GAATGAAGTA CCTTTCTGAC ATGGGCTATG TG CAT AG AG A TCTTGCTGCC 3120 
AGAAACATCT TAATCAACAG TAACCTTGTG TGCAAAGTGT CTGACTTTGG ACTTTCCCGG 3180 

10 GTACTGGAAG ATGATCCCGA GGCAGCCTAC ACCACAAGGG GAGGAAAAAT TCCAATCAG A 3240 
TGGACTGCCC CAGAAGCAAT AGCTTTCCGA AAGTTTACTT CTGCCAGTGA TGTCTGGAGT 330O 
TATGGAATAG TAATGTGGGA AGTTGTGTCT TATGGAGAGA GACCCTACTG GGAGATGACC 3360 
AATCAAGATG TGATTAAAGC GGTAGAGGAA GGCTATCGTC TGCCAAGCCC CATGGATTGT 3420 
CCTGCTGCTC TCTATCAGTT AATGCTGGAT TGCTGGCAGA AAGAGCGAAA TAGCAGGCCC 3480 

15 AAGTTTGATG AAATAGTCAA CATGTTGGAC AAGCTGATAC GTAACCCAAG TAGTCTGAAG 3540 
ACGCTGGTTA ATGCATCCTG CAGAGTATCT AATTTATTGG CAGAACATAG CCCACTAGGA 3600 
TCTGGGGCCT ACAGATCAGT AGGTGA ATGG CTAGAGGCAA TCAAGATGGG CCGGTATACA 3660 
GAGATTTTCA TGGAAAATGG ATACAGTTCA ATGGACGCTG TGGCTCAGGT G ACCTTG GAG 3720 
GATTTGAGAC GGCTTGGAGT GACTCTTGTC GGTCACCAGA AGAAGATC AT GAACAGCCTT 3780 

20 CAAGAAATG A AGGTGCAGCT GGTAAACGGA ATGGTGCCAT TGTAACTTCA TGTAAATGTC 3840 
GCTTCTTCAA GTGAATGATT CTGCACTTTG TAAACAGCAC TGAGATTTAT TTTAACAAAA 3900 
AAA 



25 




30 1 11 21 31 41 51 
I I I I I I 

MRGSGPRGAG HRRPPSGGGD TPITPASLAG CYSAPRRAPL WTCLLLCAAL RTLLASPSNE 60 
VNLLDSRTVM GDLQWIAFPK NGWEEIGEVD ENYAPIHTYQ VCKVMEQNQN NWLLTS WISN 120 
__, EGASRIFIEL KFTLRDCNSL PGGLGTCKET FNMYYFESDD QNGRNIKENQ YIKIDTIAAD 180 
35 ESFTELDLGD RVMKLNTEVR DVGPLSKKGF YLAFQDVGAC IALVSVRVYY KKCPSVVRHL 240 
AVFPDTITGA DSSQLLEVSG SCVNHSVTDE PPKMHCSAEG EWLVPIGKCM CKAGYEEKNG 300 
TCQVCRPGFF KASPfflQSCG KCPPHSYTHE EASTSCVCEK DYFRRESDPP TMACTRPPS A 360 
PRNAISNVNE TSVFLEW1PP ADTGGRKDVS YYIACKKCNS HAGVCEECGG HVRYLPRQSG 420 
LKNTSVMM VD LLAHTNYTFE 1EAVNG VSDL SPG ARQYVSV NVTTNQAAPS PVTNVKKGKI 480 
40 AKNSISLSWQ EPDRPNGIH. EYEIKHFEKD QETSYTOKS KETTITAEGL KPAS VYVFQI 540 

RARTAAGYGV FSRRFEFETT PVFAASSDQS QIPVIAVS VT VGVILLAWI GVLLSGSCCE 600 
CGCGRASSLC AVAHPIL1WR CGYSKAKQDP EEEKMHFHNG HIKLPGVRTY IDPHTYEDPN 660 
QAVHEFAKEI EASCIT1ERV IGAGEFGEVC SGRLKLPGKR ELPVAIKTLK VGYTEKQRRD 720 
A _ FLGEASIMGQ FDHPNIMLE GVVTKSKPVM IVTEYMENGS LDTFLKKNDG QFTVIQLVGM 780 
45 LRGISAGMKY LSDMGYVHRD LAARN1LINS NLVCKVSDFG LSRVLEDDPE AAYTTRGGKI 840 
PIRWTAPEAI AFRKFTS ASD VWSYGIVMWE VVS YGERPYW EMTNQDVIKA VEEGYRLPSP 900 
MDCPAALYQL MLDCWQKERN SRPKFDE1VN MLDKL1RNPS SLKTLVNASC RVSNLLAEHS 960 
PLGSGAYRSV GEWLEAIKMG RYTEIFMENG YSSMDAVAQV TLEDLRRLGV TLVGHQKKIM 1020 
NSLQEMKVQL VNGMVPL 



__, SEQ ID NO:139 PFH2 DHA SEQUENCE 

55 Nuclao Acid Accession* NM_O16029 

Coding sequence: 78-1 097 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

60 CTGCGATCCC GCAGGGCAGC G ACGCGACTC TGGTGCGGGC CGTCTTCTTC CCCCGGAGCT 60 
GGGCGTGCGC GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTGTGCGCGC 120 
TGCTCCTGCT CTTGGTGC AG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 1 80 
TATGGGCCG A GTGGCAGGGA CGACGCCC AG AATGGGAGCT GACTG ATATG GTGGTGTGGG 240 
TGACTGGAGC CTCGAGTGGA ATTGGTGAGG AGCTGGCTTA CCAGTTGTCT AAACTAGGAG 300 
65 TTTCTCTTGT GCTGTCAGCC AGAAGAGTGC ATGAGCTGGA AAGGGTGAAA AGAAGATGCC 360 
TAGAGAATGG CAATTTAAAA GAAAAAGATA TACTTGTTTT GCCCCTTGAC CTGACCGACA 420 
CTGGTTCCCA TGAAGCGGCT ACCAAAGCTG TTCTCCAGGA GTTTGGTAGA ATCGACATTC 480 
TGGTCAACAA TGGTGGAATG TCCCAGCGTT CTCTGTGCAT GGATACCAGC TTGGATGTCT 540 
_„ ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 600 
70 CTCACATG AT CG AGAGGAAG CAAGGAAAG A TTGTTACTGT G AATAGCATC CTGGGTATCA 660 
TATCTGTACC TCTTTCCATT GGATACTGTG CTAGCAAGCA TGCTCTCCGG GGTTTnTTA 720 
ATGGCCTTCG AACAGAACTT GCCACATACC CAGGTATAAT AGTTTCTAAC ATTTGCCCAG 780 
GACCTGTGCA ATCAAATATT GTG GAG A ATT CCCTAGCTGG AGAAGTCACA AAGACTATAG 840 
_ , GCAATAATGG AG ACC AGTCC C ACAAGATGA CAACCAGTCG TTGTGTGCGG CTG ATGTTAA 900 
75 TCAGCATGGC CAATGATTTG AAAGAAGTTT GGATCTCAGA ACAACCTTTC TTGTTAGTAA 960 
CATATTTGTG GCAATACATG CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 1020 
AAAGGATTGA GAACTTTAAG AGTGGTGTGG ATGC AG ACTC TTCTT ATTTT AAAATCTTTA 1080 
AGACAAAACA TGACIGAAAA GAGCACCTGT ACTTTTCA AG CCACTGGAGG GAGAAATGGA 1 140 
AAACATG AAA ACAGC AATCT TCTTATGCTT CTGAATA ATC AAAGACTAAT TTGTGATTTT 1200 
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ACTTTTTAAT AGATATGACT TTGCTTCCAA CATGGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 



I I I I I I 

MNWELLLWLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWELTD MVVWVTGASS 6C 
GIGEELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGNL KEKDILVLPL DLTDTGSHEA 120 
ATKAVLQEFG RDD1LVNNGG MSQRSLCMDT SLDVYRKLIE LNYLGTVSLT KCVLPHMffiR 180 
KQGKTVTVNS ILGIISVPLS IGYCASKHAL RGFFNGLRTE LATYPGIIVS NICPGPVQSN 240 
IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY 300 
MPTWAWWITN KMGKKRIENF KSGVDADSSY FK1FKTKHD 



SEQID N0:141 PFH1 DNA SEQUENCE 

Nucleic Add Accession* NM.021614 

Coding sequence: 1-1740 (underlined sequences correspond lo start and stopcodons) 

1 11 21 31 41 51 
I I I I I I 

ATQAGCAGCT GCAGGTACAA CGGGGGCGTC ATGCGGCCGC TCAGCAACTT GAGCGCGTCC 60 
CGCCGGAACC TGCACGAGAT GGACTCAGAG GCGCAGCCCC TGCAGCCCCC CGCGTCTGTC 120 
GGAGGAGGTG GCGGCGCGTC CTCCCCGTCT GCAGCCGCTG CCGCCGCCGC CGCTGTTTCG 1 80 
TCCTCAGCCC CCGAGATCGT GGTGTCTAAG CCCGAGCACA ACAACTCCAA CAACCTGGCG 240 
CTCTATGGAA CCGGCGGCGG AGGCAGCACT GGAGGAGGCG GCGGCGGTGG CGGGAGCGGG 300 
CACGGCAGCA GCAGTGGCAC CAAGTCCAGC AAAAAGAAAA ACCAGAACAT CGGCTACAAG 360 
CTGGGCCACC GGCGCGCCCT GTTCGAAAAG CGCAAGCGGC TCAGCGACTA CGCGCTCATC 420 
TTCGGCATGT TCGGCATCGT GGTCATGGTC ATCGAGACCG AGCTGTCGTG GGGCGCCTAC 480 
GACAAGGCGT CGCTGTATTC CTTAGCTCTG AAATGCCTTA TCAGTCTCTC CACGATCATC 540 
CTGCTCGGTC TGATCATCGT GTACCACGCC AGGGAAATAC AGTTG1TCAT GGTGGACAAT 600 
GGAGC AGATG ACTGGAGAAT AGCCATGACT TATGAGCGTA TTTTCTTCAT CTGCTTGGAA 660 
ATACTGGTGT GTGCTATTCA TCCCATACCT GGGAATTATA CATTCACATG GACGGCCCGG 720 
CTTGCCTTCT CCTATGCCCC ATCCACAACC ACCGCTGATG TGGATATTAT TTTATCTATA 780 
CCAATGTTCT TAAGACTCTA TCTGATTGCC AGAGTCATGC TTTTACATAG CAAACTTTTC 840 
ACTGATGCCT CCTCTAGAAG CATTGGAGCA CTTAATAAGA TAAACTTCAA TACACGTTTT 900 
GTTATGAAGA CTTTAATGAC TATATGCCCA GGAACTGTAC TCTTGGTTTT TAGTATCTCA 960 
TTATGGATAA TTGCCGCATG GACTGTCCGA GCTTGTGAAA GGTACCATGA TCAACAGGAT 1020 
GTTACTAGCA ACTTCCTTGG AGCGATGTGG TTGATATCAA TAACTT1TCT CTCCATTGGT 1080 
TATGGTGACA TGGTACCTAA CACATACTGT GGAAAAGGAG TCTGCTTACT TACTGGAATT 1 140 
ATGGGTGCTG GTTGCACAGC CCTGGTGGTA GCTGTAGTGG CAAGGAAGCT AGAACTTACC 1200 
AAAGCAGAAA AACACGTGCA CAATCTCATG ATGGATACTC AGCTGACTAA AAGAGTAAAA 1260 
AATGCAGCTG CCAATGTACT CAGGGAAACA TGGCTAATTT ACAAAAATAC AAAGCTAGTG 1320 
AAAAAGATAG ATCATGCAAA AGTAAGAAAA CATCAACGAA AATTCCTGCA AGCTATTCAT 1380 
CAATTAAGAA GTGTAAAAAT GGAGCAGAGG AAACTGAATG ACCAAGCAAA CACTTTGGTG 1440 
GACTTGGCAA AGACCCAGAA CATCATGTAT GATATGATCT CTGACTTAAA CGAAAGGAGT 1500 
GAAGACTTCG AGAAGAGGAT TGTTACCCTG GAAACAAAAC TAGAGACTTT GATTGGTAGC 1560 
ATCCACGCCC TCCCTGGGCT CATAAGCCAG ACCATCAGGC AGCAGCAGAG AGATTTCATT 1620 
GAGGCTCAGA TGGAGAGCTA CGACAAGCAC GTCACTTACA ATGCTGAGCG GTCCCGGTCC 1680 
TCGTCCAGGA GGCGGCGGTC CTCTTCCACA GCACCACCAA CTTCATCAGA GAGTAGCTAG 



Protein Accession*: NP.067627 

1 11 21 31 41 51 

MSSCRYNGGV MRPLSNLSAS RRNLHEMDSE AQPLQPPASV GGGGGASSPS AAAAAAAAVS 60 
SSAPEIWSK PEHNNSNNLA LYGTGGGGST GGGGGGGGSG HGSSSGTKSS KKKNQNIGYK 120 
LGHRRALFEK RKRLSDYALI FGMFGIWMV ETELSWGAY DKASLYSLAL KCL1SLST1I 180 
LLGL11VYHA REIQLFMVDN GADDWRIAMT YERIFFICLE ILVCA1HPIP GNYTFTWTAR 240 
LAFSYAPSTT TADVDIILSI PMFLRLYLIA RVMLLHSKLF TDASSRS1GA LNKINFNTRF 300 
VMKTLMTICP GTVLLVFSIS LW1IAAWTVR ACERYHDQQD VTSNFLGAMW US1TFLS1G 360 
YGDMVPNTYC GKGVCLLTGI MGAGCTALVV AV VARKLELT KAEKHVHNFM MDTQLTKRVK 420 
NAAAN VLRET WUYKNTKLV KKIDHAKVRK HQRKFLQAIH QLRS VKMEQR KLNDQANTLV 480 
DLAKTQNIMY DMISDLNERS EDFEKRIVTL ETKLETLIGS IHALPGLISQ TIRQQQRDFI 540 
EAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSESS 



SEQ ID N0:143 PFG9 DNASEQUENCE 
: AL1 10139, cocino region Is FGENESH predicted 
1-1 896 (underlined sequences correspond lo start and stop codons) 
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ATQCGCGCCG TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 
GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGGCC CGAACCCGAG 120 
CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 
5 GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 
ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 
TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGG G AAAG AGGTTA 360 
AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 
CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

10 TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 
GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 
GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 
TCAGCTGC AA CCTGGGGTGT GAAGGACTGG ACCTGG AAGC CCTCTTGCGT CGG AGGTGTT 720 
GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGCGTGTC AGGCATCTGC 780 

15 TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 
TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 
CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 
GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 
CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 1080 

20 AGTACCTTGG GCTTGG ATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT TTGTG AAGAC 1140 
AQAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 
GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 1260 
ACCCATCCTG TCAGGTTGGC TCGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 
AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GGTCAGCGGG TACCGCAACT 1380 

25 TGCCTTITGG TTTTGAAG AT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTCC CCTGCTGTGC CGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 1500 
GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 
AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

„ GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GGATGTCACT 1680 

30 CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 
GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 
TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT I860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCC7GA 



SEO ID N0.144 PFG9 Prolein sequence: 

Protein Accession #: none available, FGENESH predicted 



MRAVPLPAPL LPLLLLALLA APAARASRAE SVSAPWPEPE RESRPPPGPG PGNTTRFGSG 60 
AAGGSGSSSS NSSGDALVTR ISIIXRDLPT LKAAVIVAFA FTTLLIACLL LRVFRSGKRL 120 
KKTRKYDIIT TPAERVEMAP LNEEDDEDED STVFD1KYRV SLPAALRRQL PGCQTLLTVP 180 
M _ VPPPFILDID LPARCSGRPD GGIRPGKTCF PAWWHFVESW SAATWGVKDW TWKPSCVGGV 240 
45 ETKTNVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPFKVPPT STPHGFRQLQ 300 
LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFS ATGS PIKTLYTQTM 360 
STLGLDVFCG AGQRGTFCED RAVTKVLQGS SESKQLRWKP ALESGFPHHL RLLRECPPLS 420 
THPVRLARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR RHPHLDLFYK 480 
ICLPCCAVEH LREAKRSSVT VLASFEQSPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITH 540 
50 ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRCQKMVLM SEEGPPSLTG 600 
CERLTGSHHF SSHSKSWSFL SPRQPLFLSR P 



SEd ID N0:145PFG6DNA SEQUENCE 

55 Nucleic Acid Accession*: NH.013427 

Coding sequence: 875-3799 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 
,„ I I I I I I 

60 GGCTGGGCTG CGAATAGCGT GTTCCTCTCC GGCGGAACAC ACACACCCGG CCTTGGGGCT 60 
GTCTCCTGAA GCTCCCTCCT CCACGGAGAG CGCTGAGCGC CGCCGGGAAT TCCATCCCAC 120 
CGTGGGCACG CAGTCTTTGG AGGTCCCGGG CGCAGCACGC TCGGTGTCCC CACACTGCAG 180 
CAAGACAGAG ACCCCGCGGG AACCTTGAGC TTGGAACAAC CCTTGAGCCT CTGCAGTCGG 240 
AAGAGTGGGC GCAGCAGCCC AGCGGAGGCC AGGCGCGCAA CCTCGGGCGC CGGGGCAAGG 300 

65 AGAGAGTGCA GGGAGGCGCA GCTCAGGCGC CCGGCTCAGG AGCGGGAGGA AGTTCTCGCG 360 
GCGCCGGGAG CGCGGTGGAC GCGCCCTGGG CGCACGCCCA GGCAGCCTTC TCCCTGGCCC 420 
TCGGGACTGT CCTCGGGCGG CAAGGAGGAG CTTGCTGGAG TCTTAGAGGC CATCCAGAGC 480 
CAGCGAGCAG GAGCGCTGCG TCTCCCGCCT CAGCTAGGAA GGGGGAGTGG CGCTGGCAGG 540 
CTGGAGCTGG GAACCCAGCG AGCGCCTGAC CTTCCTCCTC CTCTTCCTGA CCCTCTTCGC 600 

70 GTCTTGGGCT CCGGAGGAAG GTTCTAGCGG CTGCAGGAGG TCCCCAGACC CATTTTCCTA 660 
GAAGGCTGGT GATGGATCTG CTGCTCCTGC CGCCGCCGGG GCACTTGGAG CGCACCGGCG 720 
GCGCGTGAGC TGGGCTTTGC TCTCCACCGC CCTGGGCAAA CCCCGGGCCA GCCCCGCCTG 780 
GCACCTTTGC CTG AGTCCCT TTCGGTTCCC GACCCAAAGC CACCAGCGTC CAGGGAGGGA 840 
GGAGGAGGTGGTCCTCAGGTGCAGCCCCGCCGAGATQTCCGCGCAGAGCCTGCTCCACAG 900 

75 CGTCTTCTCC TGTTCCTCGC CCGCTTCAAG TAGCGCGGCC TCGGCCAAGG GCTTCTCCAA 960 

GAGGAAGCTGCGCCAGACCCGCAGCCTGGACCCGGCCCTG ATCGGCGGCT GCGGG AGCGA 1020 
CGAGGCGGGC GCGGAGGGCA GTGCGCGGGG AGCCACGGCG GGCCGCCTCT ACTCCCCATC 1080 
ACTCCCAGCC G AG AGTCTCG GCCCTCGCTT GGCGTCCTCT TCCCGGGGTC CGCCCCCCAG 1140 
GGCCACCAGG CTACCGCCTC CTGGACCTCT TTGCTCGTCC TTCTCCACAC CCAGCACCCC 1200 
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CCAGOAGAAG TCACCATCCG GCAGCTTTCA CTTTGACTAT GAGGTTCCCC TGGGTCGCGG 1260 
CGGCCTCAAG AAGAGCATGG CCTGGGACCT GCCTTCTGTC CTGGCCGGGC CAGCCAGTAG 1320 
CCGAAGCGCT TCCAGCATCC TCTGTTCATC CGGGGGAGGC CCCAATGGCA TCTTCGCTTC 1380 
TCCTAGGAGG TGGCTCCAGC AGAGGAAGTT CCAGTCCCCA CCCGACAGTC GCGGGCACCC 1440 
CTACGTCGTG TGGAAATCCG AGGGTGATTT CACCTGGAAC AGCATGTCAG GCCGCAGTGT 1500 
GCGGCTGAGG TCAGTCCCCA TCCAGAGTCT CTCAGAGCTG GAGAGGGCCC GGCTGCAGGA 1560 
AGTGCCTTTr TATCAGTTGC AACAGGACTG TGACCTGAGC TGTCAGATCA CCATTCCCAA 1620 
AGATGGACAA AAGAGAAAGA AATCTTTAAG AAAGAAACTG GATTCACTAG GAAAGGAGAA 1680 
AAACAAAGAC AAAGAATTCA TCCCACAGGC ATTTGGAATG CCCTTATCCC AAGTCATTGC 1740 
GAATGACAGG GCCTATAAAC TCAAGCAGGA CTTGC AGAGG GACGAGCAG A AAGATGC ATC 1 800 
TGACTTTGTG GCTTCCCTCC TCCCATTTGG AAATAAAAGA CAAAACAAAG AACTCTCAAG 1860 
CAGTAACTCA TCTCTCAGCT CAACCTCAGA AACACCGAAT GAGTCAACGT CCCCAAACAC 1920 
CCCGGAACCG GCTCCTCGGG CTAGGAGGAG GGGTGCCATG TCAGTGGATT CTATCACCGA 1980 
TCTTGATGAC AATCAGTCTC GACTACTAGA AGCTTTACAA CTTTCCTTGC CTGCTGAGGC 2040 
TCAAAGTAAA AAGGAAAAAG CCAGAGATAA GAAACTCAGT CTGAATCCTA TTTACAGACA 2100 
GGTCCCTAGG CTGGTGGACA GCTGCTGTCA GCACCTAGAA AAACATGGCC TCCAGACAGT 2160 
GGGGATATTC CGAGTTGGAA GCTCAAAAAA GAGAGTGAGA CAATTACGTG AGGAATTTGA 2220 
CCGTGGGATT GATGTCTCTC TGGAGGAGGA GCACAGTGTT CATGATGTGG CAGCCTTGCT 2280 
GAAAGAGTTC CTGAGGGACA TGCCAGACCC CCTTCTCACC AGGGAGCTGT ACACAGCTTT 2340 
CATCAACACT CTCTTGTTGG AGCCGGAGGA ACAGCTGGGC ACCTTGCAGC TCCTCATATA 2400 
CCTTCTACCT CCCTGCAACT GCGACACCCT CCACCGCCTG CTACAGTTCC TCTCCATCGT 2460 
GGCCAGGCAT GCCG ATGACA ACATCAGCAA AGATGGGCAA GAGGTCACTG GGAATAAAAT 2520 
GACATCTCTA AACTTAGCCA CCATATTTGG ACCCAACCTG CTGCACAAGC AGAAGTCATC 2580 
AGACAAAGAA TTCTCAGTTC AGAGTTCAGC CCGGGCTGAG GAGAGCACGG CCATCATCGC 2640 
TGTTGTGCAA AAGATGATTG AAAATTATGA AGCCCTGTTC ATGGTTCCCC CAGATCTCCA 2700 
GAACGAAGTG CTGATCAGCC TGTTAGAGAC CGATCCTGAT GTCGTGGACT ATTTACTCAG 2760 
AAGAAAGGCT TCCCAATCAT C A AGCCCTG A CATGCTGCAG TCGGAAGTTT CCTTTTCCGT 2820 
GGGAGGGAGG CATTCATCTA CAGACTCCAA CAAGGCCTCC AGCGGAGACA TCTCCCCTTA 2880 
TGACAACAAC TCCCCAGTGC TGTCTGAGCG CTCCCTGCTG GCTATGCAAG AGGACGCGGC 2940 
CCCGGGGGGC TCGGAGAAGC TTTACAGAGT GCCAGGGCAG TTTATGCTGG TGGGCCACTT 3000 
GTCGTCGTCA AAGTCAAGGG AAAGTTCTCC TGG ACCAAGG CTTGGGAAAG ATCTGTCAGA 3060 
GGAGCCTTTC GATATCTGGG GAACTTGGCA TTCAACATTA AAAAGCGGAT CCAAAGACCC 3120 
AGGAATGACA GGTTCCTCTG GAG ACATTTT TGAAAGCAGC TCCCTAAGAG CGGGGCCCTG 3 1 80 
CTCCCTTTCT CAAGGGAACC TGTCCCCAAA TTGGCCTCGG TGGCAGGGGA GCCCCGCAGA 3240 
GCTGGACAGC GACACGCAGG GGGCTCGGAG GACTCAGGCC GCAGCCCCCG CGACGGAGGG 3300 
CAGGGCCCAC CCTGCGGTGT CGCGCGCCTG CAGCACGCCC CACGTCCAGG TGGCAGGGAA 3360 
AGCCGAGCGG CCCACGGCCA GGTCGGAGCA GTACTTGACC CTGAGCGGCG CCCACGACCT 3420 
CAGCG AGAGT GAGCTGGATG TGGCCGGGCT GCAGAGCCGG GCCACACCTC AGTGCCAAAG 3480 
ACCCCATGGG AGTGGGAGGG ATGACAAGCG GCCCCCGCCT CCATACCCGG GCCCAGGGAA 3S40 
GCCCGCGGCA GCGGCAGCCT GGATCCAGGG GCCCCCGGAA GGCGTGGAGA CACCCACGGA 3600 
CCAGGGAGGC CAAGCAGCCG AGCGAGAGCA GCAGGTCACG CAGAAAAAAC TGAGCAGCGC 3660 
CAACTCCCTG CCAGCGGGCG AGCAGGACAG TCCGCGCCTG GGGGACGCTG GCTGGCTCGA 3720 
CTGGCAGAGA GAGCGCTGGC AGATCTGGGA GCTCCTGTCG ACCGACAACC CCGATGCCCT 3780 
GCCCGAGACG CTGGTCTGAG CCCGCACCCA GCCGAGCCCC CCCTGCCCCG AGCCCCCCGC 3840 
CCTCCAGCCC AGGGGGGACC GTGGGTGGTG GCCACTGGCA CACTTAQTGT TCTTCTTTCA 3900 
CACTTCTCAA AAGTG ACACA AGAGAAATCC AGTTCACCTA CAGAGGTAGA GCACTCACGC 3960 
CCCCGCCATT GAGAATAAGG TTCCATTGCG TAGCCAGCCT TAGGAAAAAC AAACAGAACC 4020 
CAAACCAG AT GGCAATGTCC AATCTAA AAA CGTCCCTCTT GGCTCTATAA TATAAGATAC 4080 
AACTCTTGCT TGGTATAGCC TAACCGTATT TATGTGTCTT CGGTTrTGAC TATTGTGTAT 4140 
TCTGTAACAG ATTATGTATA ATCATATATG ATATATTCAC AAAGAGAAAA CAAAAGGAAC 4200 
TTTTAAAAAA AAAATCACTT CACTTATATT AAGCAATGAG ATATACTAAA CAATGAGATT 4260 
CTATAGAATG TTCTAGAATG TGCACAAGCG GGTTTCTGTG CmTGCCAT AGCTTTATAA 4320 
CTGGGGATAA CCCTTCCTTC GATACCAAAC ACTAACAAGA GGAAGCAGAA TATGAGAAGC 4380 
CATATTTTTA CATAGGAGTC AGATACAAAA AGAAAAATCA CTGAATGCTT TTAGATATTG 4440 
AATACGTTTT CAGGAAAATG CTAAATCTGA TAG ATTACGA AATATATTTT TAGAACTTGT 4500 
TTAGAAAGGA TTCAGTTAAC CAAACAAGAA AAAGGCAGTG CCTCACAAAG AAATTAAGAA 4560 
GTTGTCCGTC CCACGTTACA TCAAATTCAG TTTTATATAG GCCATATATA ATATATATTT 4620 
ATAATGTATA ATTTTTATGT ATTTTTCAAA ACTACA AACT GG AATCCAAC TATAAAGTGT 4680 
TTAAGAATCT ACACAGAATA TTCAAATTAT AG AACATGTT TTTTCCCTTT GCCCCATAAT 4740 
CAGTATTTGC CAAATTACAT GCAATTCCTT AAAAACTAAA TCACATTGGT AAAAQGCCTA 4800 
CAGCTTTGTA CTTACATTGT GCCAAAGGCT GAGGAAATGT TTTCTTTCGA ATTTTTATGT 4860 
GTATTGTAAA ATGTTCTACC GTACTTTAGT AGTTTG AAGT TTTCAAGTGC ATAACTATTT 4920 
TTGACCAGCA GAAGGCGATA CGCTTCAGTA TTTTATGCAA TTTTTTTTCA CTTCGAAGGG 4980 
AAAGTGTATT ATAAAAA AAG ATTTTTTTTT TTTAAAACAT GCTACTCTTA ATTTTCATGT 5040 
TGGTGATGAA ATTCCCAGTG GTGTTTCTTA AGGTTCTATC TTGTGCCATG ATGAATAAAA 5100 
AGTTAAGCAA AAAAAAAAAA AAAAAAAAAA AAA 

SEQ ID HO:146 PF G6 Prolan seniiRnce: 
Protein Accession*: NPJJ38286.1 

1 II 21 31 41 51 

MSAQSLLHSV FSCSSPASSS AASAKGFSKR KLRQTRSLDP ALIGGCGSDE AGAEGSARGA 60 
TAGRLYSPSL PAESLGPRLA SSSRGPPPRA TRLPPPGPLC SSFSTPSTPQ EKSPSGSFHF 120 
DYEVPLGRGG LKKSMAWDLP SVLAGPASSR S ASSILCSSG GGPNGIFASP RRWLQQRKFQ 180 
SPPDSRGHPY VVWKSEGDFT WNSMSGRS VR LRSVFIQSLS ELERARLQEV PFYQLQQDCD 240 
LSCQITIPKD GQKRKKSLRK KLDSLGKEKN KDKEFIPQAF GMPLSQVIAN DRAYKLKQDL 300 
QRDEQKDASD FVASLLPFGN KRQNKELSSS NSSLSSTSET PNESTSPNTP EPAPRARRRG 360 
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AMS VDSITDL DDNQSRLLEA LQLSLPAEAQ SKKF.KARDKK LSLNPIYRQV PRLVDSCCQH 420 
LEKHGLQTVG IFRVGSSKKR VRQLREEFDR GIDVSLEEEH SVHDVAALLK EFLRDMPDPL 480 
LTRELYTAFI NTLLLEPEEQ LGTLQLLIYL LPPCNCDTLH RLLQFLStVA RHADDNISKD 540 
GQEVTGNKMT SLNLATIFGP M.LLHKQKSSD KEFSVQSSAR AEESTAUAV VQKMIENYEA 600 
5 LFMVPPDLQN EVLISLLETD PDVVDYLLRR KASQSSSPDM LQSEVSFSVG GRHSSTDSNK 660 
ASSGDISPYD MNSPVLSERS LLAMQEDAAP GGSEKLYRVP GQFMLVGHLS SSKSRESSPG 720 
PRLGKDLSEE PFDIWGTWHS TLKSGSKDPG MTGSSGDIFE SSSLRAGPCS LSQGNF.SPNW 780 
PRWQGSPAEL DSDTQGARRT QAA APATEGR AHPAVSR ACS TPHVQVAGKA ERPTARSEQY 840 
LTLSGAHDLS ESELDVAGLQ SRATPQCQRP HGSGRDDKRP PPPYPGPGKP AAAAAWIQGP 900 
10 PEGVETPTDQ GGQAAEREQQ VTQKKLSSAN SLPAGEQDSP RLGDAGWLDW QRERWQIWEL 960 
LSTDNPDALP ETLV 



15 SEQ ID NO:147PFG4DNA SEQUENCE 

Nucleic Acid Accession*: NMJJ02202 

Coding sequence: 240-1289 (underilned sequences correspond toslart and siopcodons) 



20 1 11 21 31 41 51 
I I I I I I 

CCCCCGAGCC GCGCCGAGTC TGCCGCCGCC GCAGCGCCTC CGCTCCGCCA ACTCCGCCGG 60 
CTTAAATTGG ACTCCTAGAT CCGCGAGGGC GCGGCGCAGC CGAGCAGCGG CTCTTTCAGC 120 
ATTGGCAACC CCAGGGGCCA ATATTTCCCA CTTAGCCACA GCTCCAGCAT CCTCTCTGTG 180 

25 GGCTGTTCAC CAACTGTACA ACCACCATTT CACTGTGGAC ATTACTCCCT CTTACAGATA 240 
TG.GGAGACAT GGGAGATCCA CCAAAAAAAA AACGTCTGAT TTCCCTATGT GTTGGTTGCG 300 
GCAATCAGAT TCACGATCAG TATATTCTGA GGGTTTCTCC GGATTTGGAA TGGCATGCGG 360 
CATGTTTGAA ATGTGCGGAG TGTAATCAGT ATTTGGACGA GAGCTGTACA TGCTTTGTTA 420 
GGGATGGGAA AACCTACTGT AAAAGAGATT ATATCAGGTT GTACGGGATC AAATGCGCCA 480 

30 AGTGCAGCAT CGGCTTCAGC AAGAACGACT TCGTGATGCG TGCCCGCTCC AAGGTGTATC 540 
ACATCGAGTG TTTCCGCTGT GTGGCCTGCA GCCGCCAGCT CATCCCTGGG GACGAATTTG 600 
CGCTTCGGGA GGACGGTCTC TTCTGCCGAG CAGACCACGA TGTGGTGGAG AGGGCCAGTC 660 
TAGGCGCTGG CGACCCGCTC AGTCCCCTGC ATCCAGCGCG GCCACTGCAA ATGGCAGCGG 720 
AGCCCATCTCCGCCAGGCAGCCAGCCCTGCGGCCCCACGTCCACAAGCAGCCGGAGAAGA 780 

35 CCACCCGCGT GCGGACTGTG CTGAACGAGA AGCAGCTGCA CACCTTGCGG ACCTGCTACG 840 
CCGCAAACCC GCGGCCAGAT GCGCTCATG A AGG AGC AACT GGTAGAGATG ACGGGCCTCA 900 
GTCCCCGTGT GATCCGGGTC TGGTTTCAAA ACAAGCGGTG CAAGGACAAG AAGCGAAGCA 960 
TCATG ATCAA GCAACTCCAG CAGCAGCAGC CCAATGACAA AACTAATATC CAGGGGATGA 1020 
. n CAGGAACTCC CATGGTGGCT GCCAGTCCAG AGAGACACGA CGGTGGCTTA CAGGCTAACC 1080 

40 CAGTGGAAGT ACAAAGTTAC CAGCCACCTT GGAAAGTACT GAGCGACTTC GCCTTGCAGA 1 140 
GTGACATAGA TCACCCTGCT TTTCAGCAAC TGGTCAATTT TTCAGAAGGA GGACCGGGCT 1200 
CTAATTCCAC TGGCAGTGAA GTAGCATCAA TGTCCTCTCA ACTTCCAGAT ACACCTAACA 1260 
GCATGGTAGC CAGTCCTATT GAGGCATGAG GAACATTCAT TCTGTATTTT TTTTCCCTGT 1320 
TGGAGAAAGT GGGAAATTAT AATGTCGAAC TCTGAAACAA AAGTATTTAA CGACCCAGTC 1380 

45 AATGAAAACT GAATCAAGAA ATGAATGCTC CATGAAATGC ACGAAGTCTG TTTTAATGAC 1440 
AAGGTGATAT GGTAGCAACA CTGTGAAGAC AATCATGGGA TTTTACTAGA ATTAAACAAC 1500 
AAACAAAACG CAAAACCCAG TATATGCTAT TCAATGATCT TAGAAGTACT GAAAAAAAAA 1560 
GACGTTTTTA AAACGTAGAG GATTTATATT CAAGGATCTC AAAGAAAGCA TTTTCATTTC 1620 

„ ACTGCACATC TAGAGAAAAA C AAAAATAGA A AATTTTCTA GTCCATCCTA ATCTGAATGG 1680 

50 TGCTGTTTCT ATATTGGTCA TTGCCTTGCC A A AC AGG AGC TCCAGCAAAA GCGCAGGAAG 1740 
AGAGACTGGC CTCCTTGGCT GAAAGAGTCC TTTCAGGAAG GTGGAGCTGC ATTGGTTTGA 1800 
TATGTTTAAA GTTGACTTTA ACAAGGGGTT AATTGAAATC CTGGGTCTCT TGGCCTGTCC 1860 
TGTAGCTGGT TTAimTTA CTTTGCCCCC TCCCCACTTT TTTTG AGATC CATCCTTTAT 1920 
CAAGAAGTCT GA AGCGACTA TAAAGGTTTT TG AATTCAGA TTTAAAAACC AACTTATAAA 1 980 

55 GCATTGCAAC AAGGTTACCT CTATTTTGCC ACAAGCGTCT CGGGATTGTG TTTGACTTGT 2040 
GTCTGTCCAA GAACTTTTCC CCCAAAGATG TGTATAGTTA TTGGTTAAAA TGACTGTTTT 2100 
CTCTCTCTAT GGAAATAAAA AGG AAAAAAA AAAGGAAACT TnTTTGTrr GCTCTTGCAT 2160 
TGCAAAAATT ATAAAGTAAT TTATTATTTA TTGTCGGAAG ACTTGCCACT TTTCATGTCA 2220 
,^ TTTGACATTT TTTGTTTGCT GAAGTGAAAA AAAAAGATAA AGGTTGTACG GTGGTCTTTG 2280 

60 AATTATATGT CTAATTCTAT GTGTTTTGTC TTTTTCTTAA ATATTATGTG AAATCAAAGC 2340 
GCCATATGTA GA ATTATATC TTCAGGACTA TTTCACTAAT AAACATTTGG CATAGAT 



65 §E q ID NQ:148PFS4 Prote in sequence ; 
Protein Accession #: NP_002193.1 

1 II 21 31 41 51 

70 MGDPPKKKRL ISLCVGCGNQ IHDQYILRVS PDLEWHAACL KCAECNQYLD ESCTCFVRDG 60 
KTYCKRDYIR LYGIKCAKCS 1GFSKNDFVM RARSKVYHD3 CFRCVACSRQ LIPGDEFALR 120 
EDGLFCRADH DVVERASLGA GDPLSPLHPA RPLQMAAEPI SARQPALRPH VHKQPEKTTR 180 
VRTVLNEKQL HTLRTCYAAN PRPDALMKEQ LVEMTGLSPR VIRVWFQNKR CKDKKRS1MM 240 
KQLQQQQPND KTNIQGMTGT PMVAASPERH EXjGLQANPVE VQSYQPPWKV LSDFAIJJSDl 300 

75 DQPAFQQLVN FSEGGPGSNS TGSEVASMSS QLPDTPNSMV ASPD3A 
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SEQ ID NO: 1 49 PFQ2 DNA SEQUENCE 

Nucleic Acid Accession*: NM_001172 

Coding sequence: 39-1 1 03 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GCGGAGCTCT GCCTTGGAGA TTCTCAGTGC TGCGG ATCATfiTCCCTAAGG GGCAGCCTCT 60 
CGCGTCTCCT CCAG ACGCGA GTGCATTCCA TCCTGA AG AA ATCCGTCCAC TCCGTGGCTG 1 20 
TGATAGGAGC CCCGTTCTCA CAAGGGCAGA AAAGAAAAGG AGTGGAGCAT GGTCCCGCTG 180 
CCATAAGAGA AGCTGGCTTG ATGAAAAGGC TCTCCAGTTT GGGCTGCCAC CTAAAAGACT 240 
TTGGAGATTT GAGTTTTACT CCAGTCCCCA AAGATGATCT CTACAACAAC CTGATAGTGA 300 
ATCCACGCTC AGTGGGTCTT GCCAACCAGG AACTGGCTGA GGTGGTTAGC AGAGCTGTGT 360 
CAGATGGCTA CAGCTGTGTC ACACTGGGAG GAGACCACAG CCTGGCAATC GGTACCATTA 420 
GTGGCCATGC CCGACACTGC CCAGACCTTT GTGTTGTCTG GGTTGATGCC CATGCTGACA 480 
TCAACAC ACC CCTTACCACT TCATCAGGAA ATCTCC ATGG AC AGCCAGTT TCATTTCTCC 540 
TCAGAGAACT ACAGGATAAG GTACCACAAC TCCCAGGATT TTCCTGGATC AAACCTTGTA 600 
TCTCTTCTGC AAGTATTGTG TATATTGGTC TGAGAGACGT GGACCCTCCT GAACATTTTA 660 
TTTTAAAGAA CTATGATATC CAGTATTTTT CCATGAGAGA TATTGATCGA CTTGGTATCC 720 
AGAAGGTCAT GGAACGAACA TTTGATCTGC TGATTGGCAA GAGACAAAGA CCAATCCATT 780 
TGAGTTTTGA TATTGATGCA TTTGACCCTA CACTGGCTCC AGCCACAGGA ACTCCTGTTG 840 
TCGGGGGACT AACCTATCGA GAAGGCATGT ATATTGCTGA GGAAATACAC AATACAGGGT 900 
TGCTATCAGC ACTGGATCTT GTTGAAGTCA ATCCTCAGTT GGCCACCTCA GAGGAAGAGG 960 
CGAAGACTAC AGCTAACCTG GCAGTAGATG TGATTGCTTC AAGCTTTGGT CAGACAAGAG 1020 
AAGGAGGGCA TATTGTCTAT GACCAACTTC CTACTCCCAG TTCACCAGAT GAATCAGAAA 1080 
ATCAAGCACG TGTGAGAATT TAGGAGACAC TGTGCACTGA CATGTTTCAC AACAGGCATT 1 140 
CCAGAATTAT GAGGCATTGA GGGGATAGAT GAATACTAAA TGGTTGTCTG GGTCAATACT 1200 
GCCTTAATGA GAACATTTAC ACATTCTCAC AATTGTAAAG TTTCCCCTCT ATTTTGGTCA 1260 
CCAATACTAC TGTAAATGTA nTGGTTTTT TGCAGTTCAC AGGGTATTAA TATGCTACAG 1320 
TACTATGTAA ATTTAAAGAA GTCATAAACA GCATTTATTA CCTTGGTATA TCATACTGGT 1380 
CTTGTTGCTG TTGTTCCTTC ACATTTAAGT GGTmTCAT CTTTCCTCCC TCCTCCCACA 1440 
GCCTGGCTAT ACAGTGCATC CTTGAACTGT CAGCCCACAG CAGCAATATG CTTATTCTAT 1500 
CCACATCCCT AACATCATGC ATTCACAAGG TCAAAGTTCT GGTCCACAAA CCCITCCCTA 1560 
TAGAAGTTCA ATGGCTGCGA AAGAATTTGT AGTAAACCAG GCCTCCCAGG ATGGCGAGCT 1620 
CCAGTAAGAT GATAATGGAA AGCAGCAGCT TGTTGGTTGT CACTCTACAA AGAGAAGCAA 1680 
AGTGGGGAGT AGTCAGAAGT TTGGATAACC TTCCTTCTAA ACATTTGGGG GTTAGACCTG 1740 
GGACCACGGC TGGATACTCT GAGGCTGTAT GTTTGATCAC ACAGCCACTT AGCAGGAAGT 1 800 
ACTCATAAGG TTCTTTAGCT GTCACTTAGG GATAACACTG TCTACCTCAC AGAAATGTTA 1860 
AACTGAGACA ATAAAACCCA AAGCAT 



Protein Accession*: 

1 11 21 31 41 51 
I I I I I I 

MSLRGSLSRL LQTRVHSILK KSVHSVAVIG APFSQGQKRK GVEHGPAAIR EAGLMKRLSS 60 
LGCHLKDFGD LSFTPVPKDD LYNNLIVNPR SVGLANQELA EVVSRAVSDG YSCVTLGGDH 120 
SLA1GTISGH ARHCPDLCVV WVDAHADINT PLTTSSGNLH GQPVSFLLRE LQDKVPQLPG 180 
FSWIKPC1SS ASIVYIGLRD VDPPEHFILK NYDIQYFSMR DIDRLGIQKV MERTFDLLIG 240 
KRQRPIHLSF DIDAFDPTLA PATGTPVVGG LTYREGMYIA EEIHNTGLLS ALDLVEVNPQ 300 
LATSEEEAKT TANLAVDVIA SSFGQTREGG HIVYDQLPTP SSPDESENQA RVRI 



SEQIDN0:151 PFG1 DNA SEQUENCE 
Nucleic Acid Accession*: NM_017906 ^ t ^ 

1 11 21 31 41 51 
I I I I I I 

AATTATATAT TTTTACTCTA TGTTTCTCTA CATGl'lTITl' TCTTTCCGTT GCTGGCGGAA 60 
GAGGCACGTG CGCTGCTGAAJSGAGCTGGT CGCTGGTTGC TACGAGCAGG TCCTCTTTGG 120 
GTTCGCTGTA CACCCGGAGC CCAAGGCTTG CGGCGACCAC GAGCAATGGA CTCTTGTGGC 180 
TG ACTTCACT CACCATGCTC ACACTGCCTC CTTGTCAGCA GTAGCTGTAA ATAGTCGTTT 240 
TGTGGTCACT GGGAGCAAAG ATGAAACAAT TC ACATTTAT GAC ATGAAAA AG A AGATTG A 300 
GCATGGGGCT CTAGTGCATC ACAGTGGTAC AATAACTTGC CTGAAATTCT ATGGCAACAG 360 
GCATTTAATC AGTGGAGCGG AAGATGGACT CATCTGTATC TGGGATGCAA AGAAATGGGA 420 
ATGCCTGAAG TCAATTAAAG CTCACAAAGG ACAGGTGACC TTCCTTTCTA TTCACCCATC 480 
TGGCAAGTTG GCCCTGTCGG TTGGTACAGA TAAAACTCTA AGAACGTGGA ATCTTGTAG A S40 
AGGAAGATCA GCATTCATAA AAAATATAAA ACAAAATGCT CACATAGTAG AATGGTCCCC 600 
AAGAGGAGAG CAGTATGTAG TTATCATACA GAATAAAATA GACATCTATC AGCTTGACAC 660 
TGCATCCATT AGTGGCACCA TCACAAATGA AAAGAGAATT TCCTCTGTTA AATTTCTTTC 720 
AGAGTCTGTC CTTGCAGTGG CTGGAGATGA AGAAGTTATA AGGTTTTTTG ACTGTGATTC 780 
ACTAGTGTGC CTCTGCG AAT TTAAAGCTCA TGAAAACAGG GTAAAGGACA TGTTCAGTTT 840 
TGAAATTCCA GAGCATCATG TTATTGTTTC AGCATCGAGT GATGGTTTCA TCAAAATGTG 900 
GAAGCTTAAG CAGGATAAGA AAGTTCCCCC ATCTTTACTC TGTGAAATAA ACACTAATGC 960 
CAGGCTGACG TGTCTTGGAG TGTGGCTAGA CAAAGTGGCA GACATGAAAA GCCTTCCTCC 1020 
AGCTGCAGAG CCTTCTCCTG TAAGTAAAGA ACAGTCCAAA ATTGGCAAAA AGGAGCCTGG 1080 
TGACACAGTG CACAAAGAAG AAAAGCGGTC AAAACCTAAC ACAAAGAAAC GCGGTTTAAC 1 140 
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AGGTG AC AGT AAGAAAGCAA CAAAAGAAAG TGGCCTGATA TCAACCAAGA AGAGGAAAAT 1 200 
GGTAGAAATG TTGGAAAAGA AGAGGAAAAA GAAGAAAATA AAAACAATGC AGTGAATCAC 1260 
AGATGTCTCC TGAAAGAACT CTTTTAGATG AAATCATTCT ACTCAAATGT ACCTTAATTT 1320 
_ 1 1 1T1T1 ICC CTGAGTAAAA GCAAGAAATT TCTTCCTTTG GAAAAAATAT ATATATTAAA 1380 
5 AAACCACTTT TAG ATGGTTT TTTTTAA A A A AAAA A A A A AA ACTGGTA AAA TTACTTTTGG 1440 
CAGACAGTGT TTTATGAATT ATGTATCATG TTGATATATA ATATGTTAAT GTGTCATGTA 1500 
A TTTTT ACTT TGTACAAAGC AAATAAAGAT CTTTCTCAAA AAAAAAAAAA AAAA 



10 

?eqipni?;i??pf9i EEatejagsausass: 

Protein Accession!: NP.060376.1 
, 1 11 21 31 41 51 

15 | | | | | | 

MELVAGCYEQ VLFGFAVHPE PKACGDHEQW TLVADFTHHA HTASLSAVAV NSRPVVTGSK 60 
DET1HIYDMK KKIEHGALVH HSGTITCLKF YGNRHLISGA EDGLICIWDA KKWECLKSIK 120 
AHKGQVTFLS IHPSGKLALS VGTDKTLRTW NI.VEGRSAH KNIKQNAHIV EWSPRGEQYV 180 
VIIQNKID1Y QLDTASISGT ITNEKRISSV KFLSESVLAV AGDEEVIRFF DCDSLVCLCE 240 
20 FKAHENRVKD MFSFEIPEHH VIVSASSDGF IKMWKLKQDK KVPPSLLCEI NTNARLTCLG 300 

VWLDKVADMK SLPPAAEPSP VSKEQSKIGK KEPGDTVHKE EKRSKPNTKK RGLTGDSKKA 360 
TKESGLISTK KRKMVEMLEK KRKKKKIKTM Q 



SEQ ID N0:153 PFD5 DNA SEQUENCE 

Nucleic Acid Accession*: NM_014668 

Coding sequence: 110-2953 (underlined sequences correspond to start and slop codons) 

30 1 11 21 31 41 51 

gatg'tcttgg ACATGCTCTG GCTGGCTAAT CTCCATGTTC TAGCCGACTG AAAATACGGT 60 
GGCCAAGTGG ATGGTGTGCT TATTTGCAGT CTAAAGAAAT TTCCTTTTGAJTGTGGCAGAA 120 
AATCGAGGAT GTGGAGTGGA GACCCCAGAC TTACTTGGAG CTGGAGGGTC TGCCTTGCAT 180 

35 CCTGATCTTC AGTGGGATGG ACCCGCATGG GGAGTCCTTG CCGAGGTCTT TGAGGTACTG 240 
TGACCTGCGA TTGATAAACT CCTCCTGCTT GGTGAGAACA GCCTTGGAGC AGGAGCTGGG 300 
CCTGGCTGCC TACTTTGTGA GCAACGAGGT TCCCTTGGAG AAGGGGGCTA GGAACGAGGC 360 
CTTGGAGAGT GATGCTGAGA AGCTGAGCAG CACAGACAAC GAGGATGAGG AGCTGGGGAC 420 
AGAAGGCTCT ACCTCGGAGA AGAGAAGCCC CATGAAAAGG GAGAGGTCCC GCTCCCACGA 480 

40 CTCAGCATCC TCATCCCTCT CCTCCAAGGC TTCCGGTTCA GCGCTCGGTG GCGAGTCCTC 540 
GGCTCAGCCC ACAGCACTCC CCCAGGGAGA GCATGCCAGG TCGCCCCAGC CCCGTGGCCC 600 
CGCAGAGGAG GGCAGAGCCC CTGGTGAGAA ACAGAGGCCC CGGGCAAGTC AGGGGCCACC 660 
CTCGGCCATC AGCAGGCACA GTCCCGGGCC GACGCCCCAG CCCGACTGTA GCCTCAGGAC 720 
CGGCCAGAGG AGCGTCC AGG TGTCGGTCAC CTCGTCGTGC TCCCAGCTGT CCTCCTCCTC 780 

45 GGGCTCATCC TCCTCATCCG TGGCGCCCGC TGCCGGCACG TGGGTCCTGC AGGCCTCCCA 840 
GTGCTCCTTG ACCAAGGCCT GCCGCCAGCC ACCCATTGTC TTCTTGCCCA AGCTCGTGTA 900 
CGACATGGTT GTGTCCACTG ACAGCAGTGG CCTGCCCAAG GCCGCCTCCC TCCTGCCCTC 960 
CCCCTCGGTC ATGTGGGCCA GCTCTTTCCG CCCCCTGCTC AGCAAGACCA TGACATCCAC 1020 
CGAGCAGTCC CTCTACTACC GGCAGTGGAC GGTGCCCCGG CCCAGCCACA TGGACTACGG 1080 

50 CAACCGGGCC GAGGGCCGCG TGGACGGCTT CCACCCCCGC AGGCTGCTGC TCAGCGGCCC 1140 
CCCTCAGATC GGGAAGACAG GTGCCTACCT GCAGTTCCTC AGTGTCCTGT CCAGGATGCT 1200 
TGTTCGGCTC ACAGAAGTGG ATGTCTATGA CGAGGAGGAG ATCAATATCA ACCTCAGAGA 1260 
AGAATCTGAC TGGCATTATC TCCAGCTTAG CGACCCCTGG CCAGACCTGG AGCTGTTCAA 1320 
GAAGTTGCCC TTTGACTACA TCATTCACGA CCCGAAGTAT GAAGATGCCA GCCTGATTTG 1380 

55 TTCGCACTAT CAGGGTATAA AGAGTGAAGA CAGAGGGATG TCCCGGAAGC CGGAGGACCT 1440 
TTATGTGCGG CGTCAGACGG CACGGATGAG ACTGTCCAAG TACGCAGCGT ACAACACTTA 1500 
CCACCACTGT GAGCAGTGCC ACCAGTACAT GGGCTTOCAC CCCCGCTACC AGCTGTATGA 1560 
GTCCACCCTG CACGCCTTTG CCTTCTCTTA CTCCATGCTA GGAGAGGAGA TCCAGCTGCA 1620 

, CTTCATCATCCCC^AGTCCAAGGACK;ACCACTTTGTCTTCAGCCAACCTGGAGGCCAGCT 1680 

60 GGAGAGCATG CGACTACCCC TCGTGACAGA CAAGAGCCAT GAATATATAA AAAGTCCGAC 1740 
ATTCACTCCA ACCACCGGCC GTCACGAACA TGGGCTCTTT AATCTGTACC ACGCXATGGA 1800 
CGGTGCCAGC CATTTGCACG TGCTGGTTGT CAAGGAATAC GAGATGGCAA TTTATAAGAA 1860 
ATATTGGCCC A ACCACATCA TGCTGGTGCT CCCCAGTATC TTCAACAGTG CTGGAGTTGG 1920 
TGCTGCTCAT TTCCTCATCA AGGAGCTGTC CTACCATAAC CTGGAGCTCG AGCGGAACCG 1980 

65 GCAGGAGGAG CTGGGAATCA AGCCGCAGGA CATCTGGCCT TTCATTGTGA TCTCTGATGA 2040 
CTCCTGCGTG ATGTGGAACG TGGTGGATGT CAACTCTGCT GGGGAGAGAA GCAGGGAGTT 2100 
CTCCTGGTCG GAAAGGAACG TGTCTTTGAA GCACATCATG CAGCACATCG AGGCGGCCCC 2160 
CGACATCATG CACTACGCCC TGCTGGGCCT GCGGAAGTGG TCCAGCAAGA CCCGGGCCAG 2220 

„ CGAGGTGCAA GAGCCCTTCT CCCGCTGCCA CGTGCACAAC TTCATCATCC TGAACGTGGA 2280 

70 CCTGACCCAG AACGTGCAGT ACAACCAGAA CCGGTTCCTG TGTGACGATG TAGACTTCAA 2340 
CCTGCGGGTG CACAGCGCCG GCCTCCTGCT CTGCCGGTTC AACCGCTTCA GCGTGATGAA 2400 
GAAGCAGATC GTGGTGGGCG GCCACAGGTC CTTCCACATC ACATCCAAGG TGTCTGATAA 2460 
CTCTGCCGCG GTCGTGCCGG CCCAGTACAT CTGTGCCCCG GACAGCAAGC ACACGTTCCT 2520 
CGCAGCGCCC GCCCAGCTCC TGCTGGAGAA GTTCCTGCAG CACCACAGCC ACCTCTTCTT 2580 

75 CCCGCTGTCC CTGAAGAACC ATGACCACCC AGTGCTGTCT GTCGACTGTT ACCTGAACCT 2640 
GGGATCTCAG ATTTCTGTTT GCTATGTGAG CTCCAGGCCC CACTCTTTAA ACATCAGCTG 2700 
CTCGGACTTG CTGTTCAGTG GGCTGCTGCT GTACCTCTGT GACTCTTTTG TGGGAGCTAG 2760 
CTTTTTGAAA AAGTTTCATT TTCTGAAAGG TGCGACGTTG TGTGTCATCT GTCAGGACCG 2820 
GAGCTCACTG CGCCAGACGG TCGTCCGCCT GGAGCTCGAG GACGAGTGGC AGTTCCGGCT 2880 
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GCGCGATGAG TTCCAGACCG CCAATGCCAG GGAAGACCGG CCGCTCTTTT TTCTGACGGG 2940 
ACGACACATC TGAGGAAGAC AGCGGCGAGT TTTCTGA AGA GATGAGTGCT CAGAGCCCTC 3000 
ATGCTGTTGA GGCTAAAGGG AGGCCTGGAA CGGTGGGGCG TTTGACTGGA ATGGACCCCA 3060 
GGGACTGTCC AGGTGCAGCC CCTCCTAGTA CACATGGGCC CCCX3AGGCCG TGGTCCTGGG 3120 
AGCCAGOAAG ACTCCGCAGT GGGTGAGAAT GAAAACTTGA GACTCCCAAG TTCTGGGCCA 3180 
GCCCATTGCT CTGGGCTGTT TTAAAGCCCA TTTCACGAGG AACAAAGATT TACTTCCTGT 3240 
CCTGCCATTC. GTGTGCTTCC ATGGAC AAAC CTG ATTTTTT TCTCTTAGTT CTAAAGAATC 3300 
TTGGGTTATT TTGTAGCGGT GCCAGTATTT CAGTAGATGG GATTTCAGCC AAGTAGOTTC 3360 
CCCTGTAACC TCCTACAAAG CAATATTCCA AAGGAACATT TTAACTGTAA AGGCTGGAGA 3420 
CAAGAAAAAA TAAGTAGATC GTTTTAATAA CAATTATTTA ATTGCCTATA AGTTTGCTGT 3480 
TTCAGAGGCT AGCCCAAAGG CATCAAATTT AATAAAGTTA AACAAATTGA TTTACTTCAG 3540 
AGCAAATATG ATCCTATTAA AATAATATAG GGTAAATACC CTACCTCTTA GAAAGGGCAA 3600 
AAATGCAAAG AAGCTTTCTT TAAAACTAAA AGGGTTTTTT GGGGGGGGAG TTGGCGGGGA 3660 
GGAAATAAGG CTAACAGAGG TTGACCTAAA ATTAGCCTTA CAAAGGAGAA AGGACCACAT 3720 
TGCTTACTTG AAACAGACAA TGAAAACA AC CAAAGTGATA TATAAAATAG TTGATGAGAA 3780 
CTAGACTTAT GACTGTAGTT TACTAGAGTT TAGTTTTCAG TTGCTGAAGT AGCTCATTTT 3840 
CTCTTACTAA TGTTTGGTTC CTCAGGGAAG AATCTCACTT GACTAGAGAG GAGGTGGGAA 3900 
CAGAAGAGAG AAGGAGGCAG GGAGATGTAT TTCTTAGGGC TCACCCCTTC ACAGACTGAC 3960 
AGAATGGTTT TGTTITG1TT TGTTTTGTTT TGTTTTGTTT TTGAGATGGA CTCTAGCTCT 4020 
GTCACCCAGG CTGGAGTGCA GTGGTGCG AT CTCGGCTCAC TGCAAGCTCC GCCTCCCGGG 4080 
TTCTCACCAT TCTCCTGCCT CAGCCTCCCG AGTAGCTGGG ACTACAGGCG CCCACCACCA 4140 
CGCCCGGCTA ATTTTTTGTA TTTTTTAGTA GAGACGGGGT TTCACCATGT TAGCCAGGAT 4200 
GGTCTCGATC TCCTGACCTC GTGATCCGCC CGCCTCGGCC TCCCAAAGTG CTGGGATTAC 4260 
AGGCGTGAGC CACCGTGCCT GCCCCAGAAT GGTTTTTAAA GCCACAGTTG AGAGGCCACC 4320 
CATTGCCCGG CGCCTGGACA GTGATCATCT TGTTCATCTT GTTCAGTCCT TTCTTGTGTG 4380 
ATTGGAATTA TTCATCCCCT TTGAAAGATG AGAAGGTTGA GATGCAAAGA GTCTACCTTT 4440 
CCAAGTTCTC ACTGCTGGAA AGAGCTAGAA GCACAGTTCA AAGTTCTGGC TTCTGGACTC 4500 
TGCAGTCCAG GTCTCCCTTC TCCCACTTGC CTACCCTCA A TGCCACACTG TTTTTGAAGT 4560 
GGCCCATAAC TTGAAGGAAA AGTTTAAAGA CAGTTCAATT TAATCATCAG AATGCATTCT 4620 
TTTTTTTTTC GGAGACGGAG TTTCACTCTT GCTGCCCAGG CTGGAGTGCA ATGGTGCAAT 4680 
GATCTCGGCT CACTGCAACC TCTGCCTCCT GGGTTCAAGT GATTCTCCAG CCTCAGCCTC 4740 
CCGAGTAGCT GGGATTATGG GCGCCCACCA CCATGCCCAG CTAATTTTTG TATTTTTTTT 4800 
TTTTAGTAGA GATGGGGTTT CGCCAGGTTG GCCAGGCTGG TCTTGTGAAC TCCTGGCCTC 4860 
AGGTGATCTG CCCACCTCAT CCTCCAAAAG TGCTGGGATT ACAGGCATGA GCCACTGCGC 4920 
CTGGCCTCAG AATGCATTCT TACACATCTA TCCTAGACAT TTATAAGCAC TCTAATGGAT 4980 
AACAATCCAA GAATAAATGA TTGTAAAAGA TGATGCCGAA GAGTTGATGT CAATCTTTTT 5040 
TTCCTAAGAA AAAAAGTCCG CGAGTATTAA ATATTTAGAT CAATGTTTAT AAAATGATTA 5100 
CTTTGTATAT CTCATTATTC CTA1TITGGA ATAAAAACTG ACCTTCT1TA Al'CATATACT 5160 
TGTCTTTTGT AAATAGCAGC TTTTGTGTCA TTCTCCCCAC TTTATTAGTT AATTTAAATT 5220 
GGAAAAAACC CTCAAACTAA TATTCTTGTC TGTTCCAGTC TTATAAATAA A ACTTATAAT 5280 
GCATG 



1 11 21 31 « 51 
I I I I I I 

MWQKffiDVEW RPQTYLELEG LPCILIFSGM DPHGESLPRS LRYCDLRLIN SSCLVRTALE 60 
QELGLAAYFV SNEVPLEKGA RNEALESDAE KLSSTDNEDE ELGTEGSTSE KRSPMKRERS 120 
RSHDS ASSSL SSKASGS ALG GESSAQPTAL PQGEHARSPQ PRGPAEEGRA PGEKQRPRAS 180 
QGPPSAISRH SPGPTPQPDC SLRTGQRSVQ VSVTSSCSQL SSSSGSSSSS VAPAAGTW VL 240 
QASQCSLTKA CRQPPIVFLP KLVYDMVVST DSSGLPKAAS LLPSPSVMWA SSFRPLLSKT 300 
MTSTEQSLYY RQWTVPRPSH MDYGNRAEGR VDGFHPRRLL LSGPPQIGKT GAYLQFLSVL 360 
SRMLVRLTEV DVYDEEEIN1 NLREESDWHY LQLSDPWPDL ELFKKLPFDY IIHDPKYEDA 420 
SLICSHYQGI KSEDRGMSRK PEDLYVRRQT ARMRLSKYAA YNTYHHCEOC HQYMGFHPRY 480 
QLYESTLHAF AFSYSMLGEE IQLHFIIPKS KEHHFVFSQP GGQLESMRLP LVTDKSHEYI 540 
KSPTFTPTTG RHEHGLFNLY HAMDGASHLH VLV VKEYEMA IYKKYWPNHI MLVLPSIFNS 600 
AGVGAAHFL1 KELS YHNLEL ERNRQEELGI KPQDIWPHV ISDDSCVMWN WDVNS ACER 660 
SREFSWSERN VSLKHIMQHI EAAPDIMHYA LLGLRKWSSK TRASEVQEPF SRCHVHNFII 720 
LNVDLTQNVQ YNQNRFLCDD VDFNLRVHSA GLLLCRFNRF SVMKKQJVVG GHRSFHTTSK 780 
VSDNSAAWP AQYICAPDSK HTFLAAPAQL LLEKFLQHHS HLFFPLSLKN HDHPVLSVDC 840 
YLNLGSQIS V CYVSSRPHSL NISCSDLLFS GLLLYLCDSF VGASFLKKFH FLKGATLCVI 900 
CQDRSSLRQT VVRLELEDEW QFRLRDEFQT ANAREDRPLF FLTGRHI 



SEQ ID NO:155 PFC6 DNA SEQUENCE 

Nucleic Acid Accession*: NM_000522 

Coding sequence: 1-1167 (underiined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

ATG ACAGCCT CCGTGCTCCT CCACCCCCGC TGGATCGAGC CCACCGTCAT GTTTCTCTAC 60 
GACAACOGCG GCGGCCTGGT GGCCGACGAG CTCAACAAGA ACATGGAAGG GGCGGCGGCG 120 
GCTGCAGCAG CGGCTGCAGC GGCGGCGGCT GCCGGGGCCG GGGGCGGGGG CTTCCCCCAC 180 
CCGGCGGCTG CGGCGGCAGG GGGCAACTTC TCGGTGGCGG CCGCGGCCGC GGCTGCGGCG 240 
GCCGCCGCGG CCAACCAGTG CCGCAACCTG ATGGCGCACC CGGCGCCCTT GGCGCCAGGA 300 
GCCGCGTCCG CCTACAGCAG CGCCCCCGGG GAGGCGCCCC CGTCGGCTGC CGCCGCTGCT 360 
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GCCGCGGCTG CCGCTGC AGC CGCCGCCGCC GCCGCCGCGT CGTCCTCGGG AGGTCCCGGC 420 
CCGGCGGGCC CGGCGGCGGC AGAGGCGGCC AAGCAATGCA GCCCCTGCTC GGCAGCGGCG 480 
CAGAGCTCGT CGGGGCCCGC GGCGCTGCCC TATGGCTACT TCGGCAGCGG CTACTACCCG 540 
TGCGCCCGCA TGGGCCCGCC CCCCAACGCC ATCAAGTCGT GCCCCCAGCC CCCCTCGGCC 600 
GCCGCCGCCG CCGCCTTCGC GGACAAGTAC ATGGATACCG CCGGCCCAGC TGCCGAGGAG 660 
TTCAGCTCCC GCGCTAAGGA GTTCGCGTTC TACCACCAGG GCTACGCAGC CGGGCCTTAC 720 
CACCACCATC AGCCCATGCC TGGCTACCTG GATATGCCAG TGGTGCCGGG CCTCGGGGGC 780 
CCCGGCGAGT CGCGCCACGA ACCCTTGGGT CTTCCCATGG AAAGCTACCA GCCCTGGGCG 840 
CTGCCCAACG GCTGGAACGG CCAAATGTAC TGCCCCAAAG AGCAGGCGCA GCCTCCCCAC 900 
CTCTGGAAGT CCACTCTGCC CGACGTGGTC TCCCATCCCT CGGATGCCAG CTCCTATAGG 960 
AGGGGGAGAA AGAAGCGCGT GCCTTATACC AAGGTGCAAT TAAAAGAACT TGAACGGGAA 1020 
TACGCCACGA ATAAATTCAT TACTAAGGAC AAACGGAGGC GGATATCAGC CACGACGAAT 1080 
CTCTCTGAGC GGCAGGTCAC AATCTGGTTC CAGAACAGGA GGGTTAAAGA GAAAAAAGTC 1 140 
ATCAACAAAC TGAAAACCAC TAGTTAA 

SEQ ID HO:156 PFC6 Protein sequence: 



1 11 21 31 41 51 
i I I I I I 

MTASVLLHPR WIEPTVMFLY DNGGGLVADE LNKNMEGAAA AAAAAAAAAA AGAGGGGFPH 60 
PAAAAAGGNF SVAAAAAAAA AAAANQCRNL MAHPAPLAPG AASAYSS APG EAPPSAAAAA 120 
AAAAAAAAAA AAASSSGGPG PAGPAAAEAA KQCSPCSAAA QSSSGPAALP YGYFGSGYYP 180 
CARMGPPPNA IKSCPQPPSA AAAAAFADKY MDTAGPAAEE FSSRAKEFAF YHQGYAAGPY 240 
HHHQPMPG YL DMPVVPGLGG PGESRHEPLG LPMESYQPWA LPNGWNGQMY CPKEQAQPPH 300 
LWKSTLPDW SHPSDASSYR RGRKKRVPYT KVQLKBLERE YATNKFITKD KRRRISATTN 360 
LSERQVTIWF QNRRVKEKKV INKLKT1S 



SEQ ID N0:157 PFA3 DNA SEQUENCE 

Nucleic Acid Accession*: AW102723 

Coding sequence: 523-2676 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCCTTATGGC GATTGGGCGG CTGCAGAGAC CAGGACTCAG TTCCCCTGCC CTAGTCTGAG 60 
CCTAGTGGGT GGGACTCAGC TCAGAGTCAG TTTTCAGAAG CAGGTTTCAG TTGCAGAGTT 120 
TTCCTACACT TTTCCTGCGC TAGAGCAGCG AGCAGCCTGG AACAGACCCA GGCGGAGGAC 180 
ACCTGTGGGG GAGGGAGCGC CTGGAGGAGC TTAGAGACCC CAGCCGGGCG TGATCTCACC 240 
ATGTGCGGAT TTGCGAGGCG CGCCCTGGAG CTGCTAGAGA TCCGGAAGCA CAGCCCCGAG 300 
GTGTGCGAAG CCACCAAG AC TGCGGCTCTT GGAGAAAGCG TGAGCAGGGG GCCACCGCGG 360 
TCTCCGGCCT GTCTGCACCC TGTCGCCTGA GCTGCCTGAC AGTGACAATG ACATCCCAGT 420 
TACCAGTGTC CTTGAATTGA TAGTGGCTTC TGTTTGTCAG TCTCATATAA GAACTACAGC 480 
TCATCAGGAG GAGATCGCAG CAGGGTAAGA GACACCAACA CC&ffiTTCTG CACGAAGCTC 540 
AAGGATCTCA AGATCACAGG AG AGTGTCCT TTCTCCTTAC TGGCACCAGG TC AAGTTCCT 600 
AACGAGTCTT CAGAGGAGGC AGCAGGAAGC TCAGAGAGCT GCAAAGCAAC CGTGCCCATC 660 
TGTCAAGACA TTCCTGAGAA GAACATACAA GAAAGTCTTC CTCAAAGAAA AACCAGTCGG 720 
AGCCGAGTCT ATCTTCACAC TTTGGCAGAG AGTATTTGCA AACTGATTTT CCCAGAGTTT 780 
GAACGGCTGA ATGTTGCACT TCAGAGAACA TTGGCAAAGC ACAAAATAAA AGAAAGCAGG 840 
AAATCTTTGG AAAGAGAAGA CTTTGAAAAA ACAATTGCAG AGCAAGCAGT GCAGCAGAGT 900 
CCAGTGGAGT TATCAAAGAA TCTCTTGGTG AAGAGGTTIT TAAAATATGT TACGAGGAAG 960 
ATGAAAACAT CCTTGGGGTG GTTGGAGGCA CCCTTAAAGA TTTTTAAACA GCTTCAGTAC 1020 
CCTTCTGAAA CAGAGCAGCC ATTGCCAAGA AGCAGGAAAA AGGGGCAGCT TGAGGACGCC 1080 
TCCATTCTAT GCCTGG ATAA GGAGGATGAT TTTCTACATG TTTACTACTT CTTCCCTAAG 1140 
AGAACCACCT CCCTGATTCT TCCCGGCATC ATAAAGGCAG CTGCTCACGT ATTATATGAA 1200 
ACGGAAGTGG AAGTGTCGTT AATGCCTCCC TGCTTCCATA ATGATTGCAG CGAGTTTGTG 1260 
AATCAGCCCT ACTTGTTGTA CTCCGTTCAC ATGAAAAGCA CCAAGCCATC CCTGTCCCCC 1320 
AGCAAACCCC AGTCCTCGCT GGTGATTCCC ACATCGCTAT TCTGCAAGAC ATTTCCATTC 1380 
CATTTCATGT TTGACAAAGA TATGACAATT CTGCAATTTG GCAATGGCAT CAGAAGGCTG 1440 
ATGAACAGGA GAGACTTTCA AGGAAAGCCT AATTTTGAAT ACTTTGAAAT TCTGACTCCA 1500 
AAAATCAACC AGACCTTTAG CX5GGATCATG ACTATGTTGA ATATGCAGTT TGTTGTACGA 1560 
GTGAGGAGAT GGGACAACTC TGTGAAGAAA TCTTCAAGGG TTATGGACCT CAAAGGCCAA 1620 
ATGATCTACA TTGTTGAATC CAGTGCAATC TTGTTTTTGG GGTCACCCTG TGTGGACAGA 1680 
TTAGAAGATT TTACAGGACG AGGGCTCTAC CTCTCAGACA TCCCAATTCA CAATGCACTG 1740 
AGGGATGTGG TCTTAATAGO GGAACAAGCC CGAGCTCAAG ATGGCCTGAA GAAGAGGCTG 1800 
GGG AAGCTG A AGGCTACCCT TGAGCAAGCC CACCAAGCCC TGGAGGAGGA GAAG AAAAAG 1860 
ACAGTAGACC TTCTGTGCTC CATATTTCCC TGTGAGGTTG CTCAGCAGCT GTGGCAAGGG 1920 
CAAGTTGTGC AAGCCA AGAA GTTCAGTAAT GTCACCATGC TCTTCTCAGA CATCGTTGGG 1980 
TTCACTCCCA TCTGCTCCCA GTGCTCACCG CTGCAGGTCA TCACCATGCT CAATGCACTG 2040 
TACACTCGCT TCGACGAGCA GTGTGGAGAG CTGGATGTCT ACAAGGTGGA GACCATTGCG 2100 
ATGCCTATTG TGTGGCTTGG GGGATTACAC AAAGAGAGTG ATACTCATGC TGTTCAG ATA 2160 
GCGCTGATGG CCCTGAAGAT G ATGGAGCTC TCTGATGAAG TTATGTCTCC CCATGGAG AA 2220 
CCTATCAAGATGCGAATTGG ACTGC ACTCT GGATC AGTTT TTGCTGGCGT CGTTGGAGTT 2280 
AAAATGCCCC GTTACTGTCT TTTTGGAA AC AATGTCACTC TGGCTAACAA ATTTGAGTCC 2340 
TGCAGTGTAC CACGAAAAAT CAATGTCAGC CCA ACAACTT ACAGATTACT CAAAGACTGT 2400 
CCTGGTTTCG TGTTTACCCC TCGATCAAGG GAGGAACTTC CACCAAACTT CCCTAGTGAA 2460 
ATCCCCGGAA TCTGCCATTT TCTGGATGCT TACCAACAAG GAACAAACTC AAAACCATGC 2520 
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TTCCAAAAGA AAGATGTGGA AGATGCAAGC CAAnTTTTA GGCAAAGCAT CAGGAATAGA 2580 
TTAGCAACCT ATATACCTAT TTATAAGTCT TTGGGGTTTG ACTCATTGAA GATGTGTAGA 2640 
GCCTCTGAAA GCACTTTAGG GATTGTAGAT GGCJAACAAG CAGTATTAAA ATTTCAGGAG 2700 
CCAAGTCACA ATCTTTCTCC TGTTTAACAT GACAAAATGT ACTCACTTCA GTACTTCAGC 2760 
TCTTCAAGAA AAAAAAAAAA ACCTTA AAAA GCTACTTTTG TGGGAGTATT TCTATTATAT 2820 
AACCAGCACT TACTACCTGT ACTCAAAATT CAGCACCTTG TACATATATC AGATAATTGT 2880 
AGTCAATTGT ACAAACTGAT GGAGTCACCT GCAATCTCAT ATCCTGGTGG AATGCCATGG 2940 
TTATTAAAGT GTGTTTGTGA TAGTTGTCGT CAAAAA AAAA AAAAAAAAAA AAAAAAAAAA 3000 
AAAA 



1 11 21 31 41 51 
I I I I I I 

MFCTKLKDLK ITGECPFSLL APGQVPNESS EEA AGS SESC KATVPICQDI PEKNIQESLP 60 
QRKTSRSRVY LHTLAESICK L1FPEFERLN VALQRTLAKH KIKESRKSLE REDFEKTIAE 120 
20 QAVQQSPVEL SKNLLVKRFL KYVTRKMKTS LGWLEAPLKI FKQLQYPSET EQPLPRSRKK 180 
GQLEDAS1LC LDKEDDFLHV YYFFPKRTTS LTLPGHKAA AHVLYETEVE VSLMPPCFHN 240 
DCSEFVNQPY LLYSVHMKST KPSLSPSKPQ SSLVIPTSLF CKTFPFHFMF DKDMTILQFG 300 
NGIRRLMNRR DFQGKPNFEY FEILTPKINQ TFSGIMTMLN MQFWRVRRW DNSVKKSSRV 360 
MDLKGQMIYI VESSAILFLG SPCVDRLEDF TGRGLYLSDI PIHNALRDVV LIGEQAR AQD 420 
25 GLKKRLGKLK ATLEQAHQAL EEEKKKTVDL LCSIFPCEVA QQLWQGQVVQ AKKFSNVTML 480 
FSDIVGFTAI CSQCSPLQVI TMLNALYTRF DQQCGELDVY KVETIAMHV WLGGLHKESD 540 
THAVQIALMA LKMMELSDEV MSPHGEPIKM RIGLHSGSVF AGVVGVKMPR YCLFGNN VTL 600 
ANKFESCSVP RKINVSPTTY RLLKDCPGFV FTPRSREELP PNFPSEIPGI CHFLDAYQQG 660 
TNSKPCFQKK DVEDASQFFR QSIRNRIATY IP1YKSLGFD SLKMCRASES TLGIVDG 



SEQ ID NO:159 PFA1 DNA SEQUENCE 

Nucleic Acid Accession* NM.004362 

Coding sequence: 1 02-1 934 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 
I I I I I I 

CGCCGGCGGG ACTGGTCTGA AGAGACGCGG GGACAAAGTG GCAACGACTT GGACATCTGA 60 
GCTGTCACTG CCGAAAACAG GCCGCAAGAG AGATAATCAA TATGCATTTC CAAGCCTm 120 
GGCTATGTTT GGGTCTTCTG TTCATCTCAA TTAATGCAGA ATTTATGGAT GATGATGTTG 180 
AG ACGGAAG A CTTTGAAGAA AATTCAG AAG AAATTG ATGT TAATG AAAGT G AACTTTCCT 240 
CAGAGATTAA ATATAAGACA CCTCAACCTA TAGGAGAAGT ATATTTTGCA GAAACTTTTG 300 
ATAGTGGAAG GTTGGCTGGA TGGGTCTTAT CAAAAGCAAA GAAAGATGAC ATGGATGAGG 360 
AAATTTCAAT ATACGATGGA AGATGGGAAA TTGAAGAGTT GAAAGAAAAC CAGGTACCTG 420 
GTGACAGAGG ACTGGTATTA AAATCTAGAG CAAAGCATCA TGCAATATCT GCTGTATTAG 480 
CAAAACCATT CATTTTTGCT GATAAACCCT TGATAGTTCA ATATGAAGTA AATTTTCAAG 540 
ATGGTATTGA TTGTGGAGGT GCATACATTA AACTCCTAGC AGACACTGAT GATTTGATTC 600 
TGGAAAACTT TTATGATAAA ACATCCTATA TCATTATGTT TGGACCAGAT AAATGTGGAG 660 
AAGATTATAA ACTTCATTTT ATCTTCAGAC ATAAACATCC CAAAACTGGA GTTTTCGAAG 720 
AGAAACATGC CAAACCTCCA GATGTAGACC TTAAAAAGTT CTTTACAGAC AGGAAGACTC 780 
ATCTTTATAC CCTTGTGATG AATCCAGATG ACACATTTGA GGTGTTAGTT GATCAAACAG 840 
TTGTAAACAA AGGAAGCCTC CTAGAGGATG TGGTTCCTCC TATCAAACCT CCCAAAGAAA 900 
TTGAAGATCC CAATGATAAA AAACCTGAGG AATGGGATGA AAGAGCAAAA ATTCCTGATC 960 
CTTCTGCCGT CAAACCAGAA GACTGGGATG AAAGTGAACC TGCCCAAATA GAAGATTCAA 1020 
GTGTTGTTAA ACCTGCTCGC TGGCTTGATG ATGAACCAAA ATTTATCCCT GATCCTAATG 1080 
CTGAAAAACC TGATGACTGG AATGAAGACA CGGATGGAGA ATGGGAGGCA CCTCAGATTC 1140 
TTAATCCAGC ATGTCGGATT GGGTGTGGTG AGTGGAAACC TCCCATGATA GATAACCCAA 1200 
AATACAAAGG AGTATGGAGA CCTCCACTGG TCGATAATCC TAACTATCAG GGAATCTGGA 1260 
GTCCTCGAAA AATTCCTAAT CCAGATTATT TCGAAGATGA TCATCCATTT CTTCTGACTT 1 320 
CTTTCAGTGC TCTTGGTTTA GAGCTTTGGT CTATGACCTC TGATATCTAC TTTGATAATT 1380 
TTATTATCTG TTCGGAAAAG GAAGTAGCAG ATCACTGGGC TGCAGATGGT TGGAGATGGA 1440 
AAATAATGAT AGCAAATGCT AATAAGCCTG GTGTATTAAA ACAGTTAATG GCAGCTGCTG 1500 
AAGGGCACCC ATGGCITTGG TTGATTTATC TTGTGACAGC AGGAGTGCCA ATAGCATTAA 1560 
TTACTTCATT TTGTTGGCCA AGAAAAGTAA AGAAAAAACA TAAAGATACA GAGTATAAAA 1620 
AAACCGACAT ATGTATACCA CAAACAAAAG GAGTACTAGA GCAAGAAGAA AAGGAAGAGA 1680 
AAGCAGCCCT GGAAAAACCA ATGGACCTGG AAGAGGAAAA AAAGCAAAAT GATGGTGAAA 1740 
TGCTTGAAAA AGAAGAGGAA AGTGAACCTG AGGAAAAGAG TGAAGAAGAA ATTGAAATCA 1800 
TAGAAGGGCA AGAAGAAAGT AATCAATCAA ATAAGTCTGG GTCAG AGGAT GAG ATG AAAG 1860 
AAGCAGATGA GAGCACAGGA TCTGGAGATG GGCCGATAAA GTCAGTACGC AAAAGAAGAG 1920 
TACG AAAGGA CTAAACTAGA TTGAAATATT TTTAATTCCC GAGAGGATGT TTGGCATTGT 1980 
AAAAATCAGC ATGCCAGACC TGAACTTTAA TCAGTCTGCA CATCCTGTTT CTAATATCTA 2040 
GCAACATTAT ATTCTTTCAG ACATTTATTT TAGTCCTTCA TTTCCGAGGA AAAAGAAGCA 2100 
ACTTTGAAGT TACCTCATCT TTGAATTTAG AATAAAAGTG GCACATTACA TATCGGATCT 2160 
AAGAGATTAA TACCATTAGA AGTTACACAG TTTTAGTTGT TTGGAGATAG TTTTGGTTTG 2220 
TACAGAACAA AATAATATGT AGCAGCTTCA TTGCTATTGG AAAAATCAGT TATTGGAATT 2280 
TCCACTTAAA TGGCTATACA ACAATATAAC TGGTAGTTCT ATAATAAAAA TGAGCATATG 2340 
TTCTGTTGTG AAGAGCTAAA TGCAATAAAG TTTCTGTATG GTTGTTTGAT TCTATCAACA 2400 
ATTGAAAGTG TTGTATATGA CCCACATTTA CCTAGTTTGT GTCAAATTAT AGTTACAGTG 2460 
AGTTGTTTGC TTAAATTATA GATTCCTTTA AGGACATGCC TTGTTCATAA AATCACTGGA 2520 
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TTATATTGCA GCATATTTTA CATTTGAATA CAAGGATAAT GGGTTTTATC AAAACAAAAT 2580 
GATGTACAGA TTTTTTTTCA AGTTTTTATA GTTGCTTTAT GCCAGAGTGG TTTACCCCAT 2640 
TCACAAAATT TCTTATGCAT ACATTGCTAT TGAAAATAAA ATTTAAATAT TTTTTCATCC 27O0 
TGAAAAAAAA 



1U 1 11 21 31 41 51 
I I I I I I 

MHFQAFWLCL GLLFISINAE FMDDDVETED FEENSEEIDV NESELSSEIK YKTPQPIGEV 60 
YFAETFDSGR LAGWVLSKAK KDDMDEEISI YDGRWE1EEL KENQVPGDRG LVLKSRAKHH 120 
AISAVLAKPF EFADKPL1VQ YEVNFQDGID CGGAYIKLLA DTDDLILENF YDKTSYIIMF 180 

15 GPDKCGEDYK LHFIFRHKHP KTGVFEEKHA KPPDVDLKKF FTDRKTHLYT LVMNPDDTFE 240 
VLVDQTVVNK GSLLEDVVPP IKPPKEIEDP NDKKPEEWDE RAKIPDPSAV KPEDWDESEP 300 
AQEDSSVVK PAGWLDDEPK FIPDPNAEKP DDWNEDTDGE WEAPQILNPA CRIGCGEWKP 360 
PMIDNPKYKG VWRPPLVDNP NYQGIWSPRK IPNPDYFEDD HPELLTSFSA LGLELWSMTS 420 
DIYFDNFIIC SEKEVADHWA ADGWRWK1MI ANANKPGVLK QLMAAAEGHP WLWLIYLVTA 480 

20 GVPIALITSF CWPRKVKKKH KDTEYKKTDI C1PQTKGVLE QEEKEEKAAL EKPMDLEEEK 540 
KQNDGEMLEK EEESEPEEKS EEEIEIEGQ EESNQSNKSG SEDEMKEADE STGSGDGPIK 600 
SVRKRRVRKD 

25 SEQ ID N0:161 PE29 DNA SEQUENCE 

Nucleic Acid Accession #: NM_005932 

Coding sequence: 75 221 6 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 

30 I | | | | | 

GCGGAGCGCG CGCTCCCAGC G AAAGCAGCA GGGCAGGGAT CTGCGTTGGA GGAAGGGACT 60 
GCTCTGGTGC TAGAATGCTG TGCGTCGGAA GGCTGGGCGG CTTGGGAGCC AGAGCAGCAG 120 
CTCTGCCGCC CCGCCGGGCG GGCCGGGGAA GCCTCGAAGC CGGGATCCGG GCCCGAAGGG 1 80 
TCAGCACCAG CTGGTCTCCC GTGGGCGCCG CCTTCAATGT CAAGCCCCAG GGCAGCCGCT 240 

35 TGG ACCTGTT CGGCGAGCGG GCGCGTCTTT TTGG AGTTCC TGAGCTGAGT GCCCCAGAAG 300 
GATTTCATAT TGCACAAGAA AAAGCCTTGA GAAAGACAGA ATTGCTTGTG GACCGTGCAT 360 
GITCCACCCC ACCTGGGCCC CAGACCGTCC TGATCTTCGA TGAGCTCTCG GATTCCTTAT 420 
GCAGAGTGGC CGAC1TGGCT GATTTTGTGA AAATCGCTCA CCCTGAGCCA GCATTCAGAG 480 
AAGCTGCGGA AGAAGCTTGT AGAAGTATTG GCACCATGGT AGAGAAGTTG AACACAAATG 540 

40 TGGATTTATA TCAAAGTTTG CAAAAATTAC TAGCTGATAA AAAACTTGTG GATTCCCTTG 600 
ATCCAGAAAC AAGGCGAGTG GCTGAACTGT TTATGTTTGA TTTTGAAATT AGTGGAATCC 660 
ATCTAGACAA ACAAAAGCGT AAAAGAGCAG TGGACCTCAA TGTTAAAATC TTGGATTrGA 720 
GTAGTACATT TCTTATGGGA ACCAATTTTC CCAACAAGAT TGAGAAGCAT CTCTTACCAG 780 
AACACATTCG TCGTAACTTT ACATCTGCTG GGGATCATAT CATAATTGAT GGTCTCCACG B40 

45 CAGAATCACC AGATGACTTG GTGCGAGAAG CTGCTTATAA AATTTTTCTT TATCCCAATG 900 
CTGGTCAATT GAAATGTTTA GAAGAATTGC TCAGCACCAG AGATCTTCTG GCAAAGTTGG 960 
TGGGGTATTC CACGTTTTCT CACAGGGCTC TCCAAGGAAC GATAGCTAAA AATCCAGAGA 1020 
CTGTCATGCA GTTCCTTGAA AAACTATCTG ACAAACTTTC TGAAAGAACT CTGAAAGATT 1080 

_ rt TTGAGATGAT ACGAGGGATG AAAATGAAAC TGAATGCTCA AAATTCCGAA GTAATGCCCT 1 140 

50 GGGACCCCCC TTACTACAGT GGTGTGATIC GTGCAGAAAG GTATAATATT GAGCCCAGCC 1200 
TATATTGCCC GTTTTTCTCT CTTGGAGCAT GCATGGAAGG CCTGAATATT TTGCTTAACA 1260 
GACTGTTGGG GATTTCATTA TATGCAGAGC AGCCTGCAAA AGGAGAGGTG TGG AGCGAAG 1320 
ATGTCCGAAA ACTGGCTGTT GTTCATGAAT CTGAAGGATT GTTGGGGTAC ATTTACTGTG 1380 
AnTTTTTCA GCGAGCAGAC AAACCACATC AGGATTGCCA TTTCACTATC CGTGGAGGCA 1440 

55 GACTAAAGGA AGATGGAGAC TATCAACTCC CACTTGTAGT TCTTATGCTG AATCTTCCCC 1500 
GTTCCTCAAG GAGTTCTCCA ACTTTGCTAA CTCCTGGCAT GATGGAAAAT CTTTTCCATG 1560 
AAATGGGACA TGCCATGCAT TCAATGCTAG GACGTACTCG TTACCAACAC GTCACTGGGA 1620 
CCAGGTGCCC TACTGATTTT GCTGAGGTTC CTTCTATTCT GATGGAGTAC TTTGCAAATG 1680 
ATTATCGAGT AGTTAACCAA TTTGCCAG AC ATTATCAGAC TGGACAGCCA CTGCCAAAAA 1740 

60 ATATGGTGTC TCGTCTTTGT GAATCTAAAA AGGTTTGTGC TGCAGCTGAT ATGCAACTTC 1800 
AGGTCTTTTA TGCCACTCTG GATCAAATCT ACCATGGGAA GCATCCCCTG AGGAATTCAA 1860 
CCACAGAC AT TCTCA AGGAA ACACAAGAG A A ATTCTATGG CCTACCATAT GTTCCAAATA 1920 
CTGCCTGGCA GCTGCGATTC AGCCACCTCG TGGGGTATGG TGCTAGATAT TACTCTTACC 1980 
TCATGTCCAG AGCGGTCGCC TCCATGGTTT GGAAGGAGTG TTTTCTACAG GATCCTTTCA 2040 

65 ACAGGGCTGC CGGGGAGCGC TATCGCAGGG AGATGCTGGC CCACGGTGGA GGCAGGGAGC 2100 
CCATGCTCAT GGTTGAAGGT ATGCTTCAGA AGTGTCCTTC TGTTGATGAC TTCGTAAGTG 2160 
CCCTCGTTTC CGACTTGGAT CTGGACTTCG AAACTTTCCT CATGGATTCT GAATAAAAGA 2220 
AACACTCTAC ACCTCTAATC AAGGTCATGT AGTAATGACT TTGTTATAAA TGCTACAGCT 2280 
GTGAGAGCTT GTTTCTGATT GTTTCATTGT TCGCTTCTGT AATTCTGAAA AACTTTAAAC 2340 

70 TGGTAGAACT TGGAATAAAT AATTTGTnT A ATTAAAAAA AAAAAAAAAA AA 

gEqiDNQilWPEBPnrteln^uence; 



I I I I I I 

MLCVGRLGGL GARAAALPPR RAGRGSLEAG IRARRVSTSW SPVGAAFNVK PQGSRLDLFG 60 
ERARLFGVPE LSAPEGFHIA QEKALRKTEL LVDRACSTPP GPQTVLIFDE LSDSLCRVAD 120 
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LADFVK1AHP EPAFREAAEE ACRSIGTMVE KLNTN VDLYQ SLQKLLADKK LVDSLDPETR 180 
RVAELFMFDF EISGIHLDKQ KRKRAVDLNV KILDLSSTFL MGTNFPNKIE KHLLPEHKR 240 
NFTSAGDHU IDGLHAESPD DLVREAAYKI FLYPNAGQLK CLEELLSSRD LLAKLVGYST 300 
FSHRALQGTI AKNPETVMQFLEKLSDK1SE RTLKDFEMIR GMKMKLNAQN SEVMPWDPPY 360 
5 YSGVIRAERY N1EPSLYCPF FSLGACMEGL NILLNRLLGI SLYAEQPAKG EVWSEDVRKL 420 
AVVHESEGLL GYrYCDFFQR ADKPHQDCHF TIRGGRLKED GDYQLPLVVL MLNLPRSSRS 480 
SPTLLTPGMM ENLFHEMGHA MHSMLGRTRY QHVTGTRCPT DFAEVPSILM EYFANDYRVV 540 
NQFARHYQTG QPLPKNMVSR LCESKKVCAA ADMQLQVFYA TLDQIYHGKH PLRNSTTDIL 600 
KETQEKFYGL PYVPNTAWQL RFSHLVGYGA RYYSYLMSRA VASMV WKECF LQDPFNRAAG 660 
1 0 ERYRREMLAH GGGREPMLMV EGMLQKCPSV DDFVS ALVSD LDLDFETFLM DSE 



SEQ 10 NO:163 PEZ8 DNA SEQUENCE 

Nucleic Acid Accession #: AF103907 
15 Coding sequence: none (underlined sequences correspond to start and slop colons) 



on I I I I I I 

20 ACAGAAGAAA TAGCAAGTGC CGAGAAGCTG GCATCAGAAA AACAGAGGGG AGATTTGTGT 60 
GGCTGCAGCC GAGGGAGACC AGGAAGATCT GCATGGTGGG AAGGACCTGA TGATACAGAG 120 
GAATTACAAC ACATATACTT AGTGTTTCAA TGAACACCAA GATAAATAAG TGAAGAGCTA 180 
GTCCGCTGTG AGTCTCCTCA GTGACACAGG GCTGGATCAC CATCGACGGC ACTTTCTGAG 240 
TACTCAGTGC AGCAAAGAAA GACTACAGAC ATCTCAATGG CAGGGGTGAG AAATAAGAAA 300 

25 GGCTGCTGAC TTTACCATCT GAGGCCACAC ATCTGCTGAA ATGGAGATAA TTAACATCAC 360 
TAGAAACAGC AAGATGACAA TATAATGTCT AAGTAGTGAC ATGTITTTGC ACATTTCCAG 420 
CCCCTTTAAA TATCCACACA CACAGGAAGC ACAAAAGGAA GCACAGAGAT CCCTGGGAGA 480 
AATGCCCGGC CGCCATCTTG GGTCATCGAT GAGCCTCGCC CTGTGCCTGG TCCCGCTTGT 540 
GAGGGAAGGA CATTAGAAAA TGAATTGATG TGTTCCTTAA AGGATGGGCA GGAAAACAGA 600 

30 TCCTGTTGTG GATATTTATT TGAACGGGAT TACAGATTTG AAATGAAGTC ACAAAGTGAG 660 
CATTACCAAT GAGAGGAAAA CAGACGAGAA AATCTTGATG GCTTCACAAG ACATGCAACA 720 
AACAAAATGG AATACTGTGA TGACATGAGG CAGCCAAGCT GGGGAGGAGA TAACCACGGG 780 
GCAGAGGGTC AGGATTCTGG CCCTGCTGCC TAAACTGTGC GTTCATAACC AAATCATTTC 840 
ATATTTCTAA CCCTCAAAAC AAAGCTGTTG TAATATCTGA TCTCTACGGT TCCTTCTGGG 900 

35 CCCAACATTC TCCATATATC CAGCCACACT CATTTTTAAT ATTTAGTTCC CAGATCTGTA 960 
CTGTGACCTT TCTACACTGT AGAATAACAT TACTCATTTT GTTCAAAGAC CCTTCGTGTT 1020 
GCTGCCTAAT ATGTAGCTGA CTGTTTTTCC TAAGGAGTGT TCTGGCCCAG GGGATCTGTG 1080 
AACAGGCTGG GAAGCATCTC AAG ATCTTTC CAGGGTTATA CTTACTAGCA CACAGCATGA 1 140 
TCATTACGGA GTGAATTATC TAATCAACAT CATCCTCAGT GTCTTTGCCC ATACTG AAAT 1200 

40 TCATTTCCCA CTTTTGTGCC CATTCTCAAG ACCTCAAAAT GTCATTCCAT TAATATCACA 1260 
GGATTAACTT TTTTTTTTAA CCTGGA AGAA TTCAATGTTA CATGCAGCTA TGGGAATTTA 1320 
ATTACATATT TTGTTTTCCA GTGCAAAGAT GACTAAGTCC TTTATCCCTC CCC1TTGTTT 1380 
GATTTTTTTT CCAGTATAAA GTTAAAATGC TTAGCCTTGT ACTGAGGCTG TATACAGCAC 1440 
AGCCTCTCCC CATCCCTCCA GCCTTATCTG TCATCACCAT CAACCCCTCC CATACCACCT 1500 

45 AAACAAAATC TAACTTGTAA TTCCTTGAAC ATGTCAGGAC ATACATTATT CCTTCTGCCT 1560 
GAGAAGCTCT TCCTTGTCTC TTAAATCTAG AATG ATGTAA AGTTTTGAAT AAGTTGACTA 1620 
TCTTACTTCA TGCAAAGAAG GGACACATAT GAGATTCATC ATCACATGAG ACAGCAAATA 1680 
CTAAAAGTGT AATTTGATTA TAAGAGTTTA GATAAATATA TGAAATGCAA GAGCCACAGA 1740 

.„ GGGAATGTTT ATGGGGCACG TTTGTAAGCC TGGGATGTGA AGCAAAGGCA GGGAACCTCA 1800 

50 TAGTATCTTA TATAATATAC TTCATTTCTC TATCTCTATC ACAATATCCA ACAAGCTTTT 1860 

CACAGAATTC ATGCAGTGCA AATCCCCAAA GGTAACCTTT ATCCATTTCA TGGTGAGTGC 1920 
GCTTTAGAAT TTTGGCAAAT CATACTGGTC ACTTATCTCA ACnTGAGAT GTGTTTGTCC 1980 
TTGTAGTTAA TTG AAAGAAA TAGGGCACTC TTGTGAGCCA CTTTAGGGTT CACTCCTGGC 2040 
AATAAAGAAT TTACAAAGAG CTACTCAGGA CCAGTTGTTA AGAGCTCTGT GTGTGTGTGT 2100 

55 GTGTGTGTGT GAGTGTACAT GCCAAAGTGT GCCTCTCTCT CTTGACCCAT TATTTCAGAC 2160 
TTAAAACAAG CATGTTTTCA AATGGCACTA TGAGCTGCCA ATGATGTATC ACCACCATAT 2220 
CTCATTATTC TCCAGTAAAT GTGATAATAA TGTCATCTGT TAACATAAAA AAAGTTTGAC 2280 
TTCACAAAAG CAGCTGGAAA TGGACAACCA CAATATGCAT AAATCTAACT CCTACCATCA 2340 

rn GCTACACACT GCTTGACATA TATTGTTAGA AGCACCTCGC AnTGTGGGT TCTCTTAAGC 2400 

60 AAAATACTTG CATTAGGTCT CAGCTGGGGC TGTGCATCAG GCGGTTTGAG AAATATTCAA 2460 
TTCTCAGCAG AAGCCAGAAT TTGAATTCCC TCATCTTTTA GGAATCATTT ACCAGGTTTG 2520 
GAGAGGATTC AGACAGCTCA GGTGCTTTCA CTAATGTCTC TGAACTTCTG TCCCTCTTTG 2580 
TGTTCATGGA TAGTCCAATA AATAATGTTA TCTTTGAACT GATGCTCATA GGAGAGAATA 2640 
TAAG AACTCT G AGTG ATATC AAC ATTAGGG ATTCAAAGAA ATATTAGATT TA AGCTCACA 2700 

65 CTGGTCAAAA GGAACCAAGA TACAAAG AAC TCTGAGCTGT CATCGTCCCC ATCTCTGTGA 2760 
GCCACAACCA ACAGCAGGAC CCAACGCATG TCTGAGATCC TTAAATCAAG GAAACCAGTG 2820 
TCATGAGTTG AATTCTCCTA TTATGGATGC TAGCTTCTGG CCATCTCTGG CTCTCCTCTT 2880 
GACACATATT AGCTTCTAGC CTTTGCTTCC ACGACTTTTA TCTTTTCTCC AACACATCGC 2940 
TTACCAATCC TCTCTCTGCT CTGTTGCTTT GGACTTCCCC ACAAGAATTT CAACG ACTCT 3000 

70 CAAGTCTTTT CTTCCATCCC CACCACTAAC CTGAATGCCT AGACCCTTAT TTTTATTAAT 3060 
TTCCAATAGA TGCTGCCTAT GGGCTATATT GCTTTAGATG AACATTAGAT ATTTAAAGCT 3120 
CAAGAGGTTC AAAATCCAAC TCATTATCTT CTCTTTCTTT CACCTCCCTG CTCCTCTCCC 3180 
TATATTACTG ATTGCACTGA ACAGCATGGT CCCCAATGTA GCCATGCAAA TGAGAAACCC 3240 
AGTGGCTCCT TGTGGTACAT GCATGCAAGA CTGCTGAAGC CAGAAGGATG ACTGATTACG 3300 

75 CCTCATGGGT GGAGGGGACC ACTCCTGGGC CTTCGTGATT GTCAGGAGCA AGACCTGAGA 3360 
TGCTCCCTGC CTTCAGTGTC CTCTGCATCT CCCCTTTCTA ATGAAGATCC ATAGAATTTG 3420 
CTACATTTGA G AATTCCAAT TAGG AACTCA CATGTTTTAT CTGCCCTATC AATTTTTTAA 3480 
ACTTGCTGAA AATTAAGTTT TTTCAAAATC TGTCCTTGTA AATTACTTTT TCTTACAGTG 3540 
TCTTGGCATA CTATATCAAC TTTGATTCTT TGTTACAACT TTTCTTACTC TTTTATCACC 3600 
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AAAGTGGCTT TTATTCTCTT TATTATTATT ATTTTCTTTT ACTACTATAT TACGTTGTTA 3660 
TTATTTTGTT CTCTATAGTA TCAATTTATT TGATTTAGTT TCAATTTATT TTTATTGCTG 3720 
ACTTTTAAAA TAAGTGATTC GGGGGGTGGG AGAACAGGGG AGGGAGAGCA TTAGGACAAA 3780 
TACCTAATGC ATGTGGGACT TA AAACCTAG ATGATGGGTT GATAGGTGCA GCAAACCACT 3840 
5 ATGGCACACG TATACCTGTG TAACAAACCT ACACATTCTG CACATGTATC CCAGAACGTA 3900 
AAGTAAAATT TAAAAAAAAG TGA 



Protein Accession*: none 

SEQ ID N0:164PEZ6 DNA SEQUENCE 

Nucleic Acid Accession #: AB028945 

Coding sequence: 1-3765 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

AJgATGATGA ACGTCCCCGG CGGAGGAGCG GCCGCGGTGA TGATGACGGG CTACAATAAT 60 
GGTCGCTGTC CCCGG AATTC TCTCTAC AGT GACTGCATTA TTGAGGAGAA GACGGTGGTC 1 20 
CTGCAGAAAA AAG ACAATG A GGGCTTTGGA TTCGTGCTTC GAGGGGCCAA AGCTGACACA 180 
CCCATTGAAG AATTCACACC AACACCGGCT TTCCCAGCCC TACAGTACCT GGAGTCCGTG 240 
GATGAAGGTG GGGTGGCGTG GCAAGCCGG A CTAAGGACCG GGGACTTCTT GATTGAGGTT 300 
AACAATGAGA ATGTTGTCAA AGTCGGCCAC AGGCAGGTGG TGAACATGAT CCGGCAGGGA 360 
GGGAATCACC TGGTCCTTAA GGTGGTCACG GTGACCAGGA ATCTGGACCC CGACGACACC 420 
GCCAGGAAGA AAGCTCCCCC GCCTCCAAAG CGGGCACCGA CCACAGCCCT CACCCTGCGC 480 
TCCAAGTCCA TGACCTCGGA GCTGGAGGAG CTCGTGGATA AAGATAAACC CGAGGAGATA 540 
GTCCCGGCCT CCAAGCCCTC CCGCGCTGCT GAGAACATGG CTGTGGAACC GAGGGTGGCG 600 
ACCATCAAGC AGCGGCCCAG CAGGCGGTGC TTCCCGGCGG GCTCAGACAT GAACTCTGTG 660 
TACGAACGCC AAGGAATCGC CGTGATGACG CCCACTGTTC CTGGGAGCCC AAAAGCCCCG 720 
TTTCTGGGCA TCCCTCGAGG TACGATGCGA AGGCAGAAAT C AATAG AC AG CAGAATCTTT 780 
CTATC AG G A A TAACAGAGGA AGAGCGGCAG TCTCTGGCTC CTCCAATGCT GAAGTTCACC 840 
AGAAGCCTGT CCATGCCGGA CACCTCTGAG GACATCCCCC CTCCACCGCA GTCTGTGCCC 900 
CCGTCCCCAC CACCACCTTC CCCAACCACT TACAACTGCC CCAAGTCCCC AACTCCAAGA 960 
GTCTACGGGA CGATTAAGCC TGCGTTCAAT CAGAATTCTG CCGCCAAGGT GTCCCCCGCC 1020 
ACCAGGTCCG ACACCGTGGC CACCATGATG AGGGAGAAGG GGATGTACTT CAGGAGAGAG 1080 
CTGGACCGCT ACTCCTTGGA CTCTGAAGAC CTCTACAGTC GGAATGCCGG CCCGCAAGCC 1 140 
AACTTCCGCA ACAAGAGAGG CCAGATGCCA GAAAACCCAT ACTCAGAGGT GGGGAAGATC 1200 
GCCAGCAAAG CCGTCTACGT CCCCGCCAAG CCCGCCAGGC GGAAGGGGAT GCTGGTGAAG 1260 
CAGTCCAACG TGGAGGACAG CCCCGAGAAG ACGTGCTCCA TCCCTATCCC GACCATCATC 1320 
GTGAAGGAGC CGTCCACCAG CAGCAGCGGC AAGAGCAGCC AGGGCAGCAG CATGGAGATC 1380 
GACCCCCAGG CCCCGGAGCC ACCGAGCCAG CTGCGGCCTG ACGAAAGCCT GACCGTCAGC 1440 
AGCCCCTTTG CCGCCGCCAT CGCCGGAGCC GTCCGCGACC GTGAGAAGCG GCTGGAAGCC 1500 
AGGAGGAACT CCCCGGCCTT CCTCTCCACA GACCTGGGGG ATGAGGATGT GGGCCTGGGG 1560 
CCACCCGCCC CCAGGACGCG GCCCTCCATG TTCCCCGAGG AGGGGGATTT TGCTGACGAG 1620 
GACAGCGCTG AGCAGCTGTC ATCCCCCATG CCGAGTGCCA CGCCCAGGGA GCCCGAAAAC 1680 
CATTTCGTGG GTGGCGCCGA GGCCAGTGCT CCGGGTGAGG CTGGGAGGCC GCTGAATTCC 1740 
ACGTCCAAAG CCCAGGGGCC CGAGAGCAGC CCAGCAGTGC CCTCCGCGAG CAGCGGCACA 1800 
GCCGGCCCCG GGAATTATGT CCACCCACTC ACAGGGCGGC TGCTTGATCC CAGCTCCCCG 1860 
CTGGCCCTGG CACTCTCCGC AAGGGACCGA GCCATGAAGG AGTCTCAACA GGGACCCAAA 1920 
GGGGAGGCCC CCAAGGCCGA CCTCAACAAA CCTCTTTACA TTGATACCAA AATGCGGCCC 1980 
AGCCTGGATG CCGGCTTCCC TACGGTCACC AGGCAGAACA CCCGGGGACC CCTGAGGCGG 2040 
CAGGAGACGG AGAACAAGTA CGAGACCGAC CTGGGCCGAG ACCGGAAAGG CGATGACAAG 2100 
AAGAACATGC TGATCGACAT CATGGACACG TCCCAGCAGA AGTCGGCTGG CCTGCTGATG 2160 
GTGCACACCG TGGACGCCAC TAAGCTGGAC AACGCCCTGC AGGAAGAGGA CGAGAAGGCA 2220 
GAGGTGGAGA TGAAGCCAGA CAGCTCGCCG TCCGAGGTGC CAGAAGGTGTTTCCGAAACC 2280 
GAAGGTGCTT TACAGATCTC CGCTGCCCCC GAGCCCACCA CCGTGCCCGG CAGAACCATC 2340 
GTCGCGGTGG GCTCCATGGA AGAGGCGGTG ATTTTGCCAT TCCGCATCCCTCCTCCCCCT 2400 
CTGGCATCCG TGGACTTGGA TGAGGATTTT ATTTTTACAG AGCCATTGCC TCCTCCCCTG 2460 
GAATTTGCAA ATAGTTTTGA TATCCCCGAT GACCGGGCAG CTTCTGTCCC GGCTCTCTCA 2520 
GACTTAGTGA AGCAGAAGAA AAGCGACACC CCTCAGTCCC CTTCGTTGAA CTCCAGCCAA 2580 
CCAACCAACT CTGCAGACAG CAAGA AGCCA GCCAGTCTTT CAAACTGTCT GCCTGCCTCA 2640 
TTCCTGCCAC CCCCTGAAAG CTTTGACGCC GTCGCCGACT CTGGGATCGA GGAGGTGGAC 2700 
AGCCGGAGTA GCAGCGACCA CCACCTCGAG ACGACCAGCA CTATCTCCAC CGTGTCTAGC 2760 
ATCTCCACCC TGTCTTCCGA AGGTGGAGAG AATGTGGACA CCTGCACAGT CTATGCAGAT 2820 
GGGCAAGC AT TTATGGTTGA CAAACCCCCA GTACCTCCTA AGCCAAAAAT GAAGCCCATC 2880 
ATTCACA AAA GCAATGCACT TTATCAAGAC GCGCTCGTGG AAGAAGATGT AGATAGCTTT 2940 
GTTATCCCCC CGCCCGCTCC CCCGCCCCCG CCGGGCAGTG CCCAGCCTGG GATGGCCAAG 3000 
GTTCTCC AGC CAAGGACCTC CAAGTTGTGG GGCGACGTCA CAGAGATCAA AAGCCCGATT 3060 
CTCTCAGGCC CAAAGGCAAA CGTTATTAGT GAATTGAACT CTATCCTACA GCAAATGAAC 3120 
CG AGAGAAAT TGGCAAAGCC GGGGGAAGGA CTGGATTCAC CAATGGGAGC CAAGTCCGCC 3180 
AGCCTCGCTC CAAGAAGCCC GGAGATCATG AGCACCATCT CAGGTACACG GAGCACGACG 3240 
GTCACCTTCA CTGTTCGCCC CGGCACCTCC CAGCCCATCA CCCTGCAGAG CCGGCCCCCC 3300 
GACTATGAAA GCAGGACCTC AGGAACAAGA CGTGCCCCAA GCCCTGTGGT CTCGCCAACA 3360 
GAGATGAACA AAGAGACCCT GCCCGCCCCC CTGTCTGCTG CCACCGGCTC TCCTTCTCCC 3420 
GCTCTCTCAG ATGTCTTTAG CCTTCCAAGC CAGCCCCCTT CTGGGG ATCT ATTTGGCTTG 3480 
AACCCAGCGG GACGCAGTAG GTCGCCATCC CCCTCGATAC TGCAACAGCC AATCTCAAAT 3540 
AAGCCTTTTA CAACTAAACC TGTCCACCTG TGGACTAAAC CAGATGTGGC CGATTGGCTG 3600 
GAAAGTCTAA ACTTGGGTGA ACATAAAGAG GCCTTCATGG ACAATGAGAT CGATGGCAGT 3660 
CACTTACCAA ACCTGCAGAA GGAGGACCTC ATCGATCTTG GGGTAACTCG AGTCGGGCAC 3720 
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AGAATGAACA TAGAAAGGGC TTTGAAACAG CTGCTGGACA GATAAGGACG GCTGCTCTCC 3780 
ACCTCGCAGA CTGCTCTTGT TATAAGTAGA GATGGGCTCG TGCTGAAACA TCTGAATGCC 3840 
AAGCGAAGTC TGTGAGCATC AACCCCACTC CATGGGTTTG TCTCCTGGTA CCCAAAGAAA 3900 
TACTGAGTTG TGTCCACAAC ATGGCTGGGT CTTCAGACCC CTGGCTCACC ATGTGGGTGT 3960 
5 CTTGGGCAGT TTCTATCACA CATGGGACAA GGGGAGGGAG TTTTTCTAAC ATGGAAAAAG 4020 
ATTCCCAGCC TGCCGCCCAG CATGCAGGTG GCCTCGCTTT GCCGGGTCCG AGAGGCTCCC 4080 
CGTCAATTTT GCACGGGATC CTAGCTCTTG TAGGCAGACA CCAGTGCACT CTAGATACCT 4140 
CCTGAGACCT CCGTCCTCTG CTTTCCGGGC AGCTCTCACC ACCCCAGGCC CCGGCATGAG 4200 
GCCTTTCCTC AGTCCTGTGG CCTCTCAGAG GACACCTGAT GCTCACCTGC CCCTCTTTCT 4260 

1 0 CCTGCACTTG GCTTGCAGTG AGATGCTCCC AG ATGCATCT GTCCAGTGCC CCATCATGGG 4320 
CCTGAAAGGC AGAGAAACTT TTTCCTACAC AGATTCTTTT CCCCATCTCC TCCTGTGGTT 4380 
TGCATCCATG GCTCTTTGGC CATGAGGTTC CTGGCAGTGC TGGGAGTTTG GATGGGATCG 4440 
TGCCCAGCTT TGCTTAGCTT TCTTTATTTC TGCAAATCTG TTAGCATAAT TCCAAGGTGG 4500 

, _ CCAAGCAGAT GTCACATGGA GTTAGTCAAA GCACAAAGTC ACGATTCCAC AATGGAGGGG 4560 

1 5 AGACCTGGCC AAGGGAGCCA GCC AGCGTGC AACTGCCCAA GCTCCAGGTC TCCAGG ACA A 4620 
GAGCAGTTGT CTGCCATGAG CACCCATCCA GGATGGAGAA TAAGGGCTTC TCTGCCTCTC 4680 
AGAATTCTTT TTAATTGAAG ATGTCTTGAG CTCTGCAAAG ATCAGAGCAG GTGAGCATCC 4740 
ACTTTGACAT GAAGGACAAG AAGACGCATG GCTCATGGCG GGCACATGCG GCTGCCAGTG 4800 
AGACAGCGTC TCCTCTGGGA GCTGGGCGGG CACAGCATCC TCAGTTCTGT GCCCAGCCAA 4860 

20 GGGTGAGCAT CTCTGCTGAG ACAGTCCTTT TGCTCTCGGA GGCCAGGG AA G ATGGTACTT 4920 
. AGAGGCmT CCCCTATCGC TCTGGGTGTC TAGGAATCCC ACCAGCTTGT CTTAACAGTA 4980 
CAACAGCTTC TTTGAGGACC CAGTGGGTAT GGAGTATAGA CAGAACCCAG GGTTGAGAAC 5040 
AGAAGGTGGG CGGCAGGATC AGAGTGAAAG CAGAGGCGTG AGGAGAGGAA AGCAGGGAGG 5100 

_ _ TCTCCTGGGC TGCCAGGTCA GCCTCTCTGG CAAGGCTTTC TTGAGCCCCG CCCCTTTCTr 5160 

25 TCCCCGGAGT CCCTCCACCC CATAACAATA CCTCGAATTT CCAAAAGAGG TCACCAGATG 5220 
CACATGGGCC GCAAAACACA CAGTCAGGCT TCCAGCACAT TCTCCCCCAT TTGGAGGATA 5280 
CTCGAATGTC AGGTTTTTGG TTTTATTATT ATTTCAGAAC TAGCTCAGCC CATCTCTAAT 5340 
TATAAAACAT GGTTTTGTTT TnTTTTTTC CnTTTTTCT TGATTAGGTC TGG AACAGCT 5400 

_ n CTAGAATGAA CACATAAAAT TTAGCAATTT AAAATCTTTC TTTACTGCAA GTTTAAATAG 5460 

30 TTGTACAGAT AGTTTATAAG CACAATATTT TAAGAAAAAA AAGTGGCTGG TCTACTAGGC 5520 
AGCCTTTGTG CCACTTCAGT GCTAGAAAGT TAAAG AAAAA AAAACTTTTG TGATTTAATA 5580 
ATACTATTTC TGTGGAATAA TTATAAAAGT ATGACCTTTT TAAATCAACC TTATTTGGAT 5640 
GCATCTGAAC CAGCAGAGCT GTGTTATATT TTCTATCTTT GCTAGAACTT CGTCATTGAA 5700 
GGACAATTTC TTCAAAGTGG TTACAATTCA TAATGCAGCA GTTTCTCCAA AAACAAAAAC 5760 

35 AAAACACACA CCAC ACACAC GCGCTTTTCC AGTCACACAC CCCTGATGTT GGAACCAAGT 5820 
TTTTGGACCT TCTGTTCCAA AACCTTTTGC AGGTCAATCT TTGTATTTGA AATGATCCAA 5880 
TCCAACTTGA AGTCAATTGA ATATTAAGGC GCTTTACTTC CGTGTGCTTT CAGTITTTCC 5940 
ATCATGAGAT GAATGAGCAT TACTCTAGAT AAATTTCAAG ACAGGATACT ACAGGTGGCC 6000 
. n TGCTGAGGCT GCCCCATATT TTAGAAAATG TAAAAATGGT GGTTTGGCCA TTAATTTGTC 6060 

40 TTCCATTTGA TGATACCGCA AAATTCCGTG AGTCCATTCC TTTGGCATGG CACTTTCCCT 6120 
GGGCCTACAG TTGGTATTAC CTCTGTGCTC AGTGCCAGGC AAAACACTAG CTCAAAGGAG 6180 
AGTCAAGGAA ACCGCTGGCA GACGATAACC AGTCGAAACT CGTGACTTCG GTTTGTTGAA 6240 
CTTTGGCAGC CAGTTGGTGA GGGCCAGATG TTATTCCCTT TCTTAAAGAT ACTCCAAGCC 6300 
ACATGCCACT AACCACAAGC AAGCTGGCTG CAAGACTAAA GAGCTGATAA CATAGTTTAT 6360 

45 TTTTACACTG TCTTATTATA GAGAAGTAAT AGACCTATCA GAACCTGCAC TGACCAACAA 6420 
ATAAACACAT GTTGCCAAGA TGAATCGGTC TCTATCTCTA TCTGCTTATT TTGGTACTGA 6480 
AAGCAATAGT TCCTCATTCA AATCACCACC CACTGTTCTC CCCCTTTGGG ACATGTTAGG 6540 
ACGAGGCCCT ATTCCATGCC CCTCTTTAAT GGTGGAACAA ATGTTAAACT GCTCATCTAA 6600 
AGATCATGTT GATATTATTC CAGGTTTTAA GATCAACTTT TGTTACATAC TGTAATTTAA 6660 

50 ATAAACTGCA TTTACATGCC TAGTTTCTGT AATATTGTGT ATACAAAACC CAAATCTCTC 6720 
AAAATGTAAA TTATGTATAC CTGCCAAGAT ACCTTTTCCA GGGTGTCTGC GCACATTTTA 6780 
AGTTAATTCA CATAATATAA AAATTACTCA ATGTGACTGT TG ATTTGCTG AACTTTACAT 6840 
ATCACAAAGT GAATTATTTG TGATACTTTA GTTAATAAAA TGGTA AATTT TTTTCTCAGT 6900 
TATTGAACAA GCAAGCATTA TCCAGTTG AT CTGGCAATGA CTTTTTGTGT GTGGGCC ACA 6960 

55 ATATTGATTT TCCCATTAAC AATTTTTTTT TGTTTTTTA A ATACTAATAT GTTTCACACT 7020 
ATAGTTTGTG TAACAACACG TGTTCGCATT ATCTATGTTG CTGTTACTTT TGTGCTTTTA 7080 
TTCTTTTTAG ACTTTATAAA AAAAAAAAAA AGCTCCTGTA ATTTGCACTT TCTCCCAATC 7140 
CTTAAATCTC TTGTATGGCA ACCAAAATTA CTGTAAAAAA ATA AATATAC TATTGCACTA 7200 
AGGTTGTGGT TCTGATTGCA AACAAACAGT GAACACTGTC TGAATTAAAC AAAAAGCTGC 7260 

60 CCGACTTGCA ATCTAATGTA GATTATCTCA GGCATTGTGG CCAGCTCTGC CTCTCTAAAA 7320 
CTGACCAGAA AAATCTCTCT CATCGAGTAA ACAGGCTCCT GTCACTGAGC TAATCTGCCT 7380 
TGGTTCCATT TCCTTATTCT CAATTTATCA ATGGATACGT GCATGTTATT TCAGAATTAT 7440 
GCAAAACGTC AAAATCTGCT TCTGTGACCG CTGCTATAGG CGTGGAGCTG AGGCTCGGCT 7500 
TTTCCTTTTG TTCTGGGTGG AAGCAGCGGT GCCGCGGAGG GCCAGCCAGA TCCGGACCCT 7560 

65 TCCCTTAGGG TCCAGTCTCC CCACACCCCA GCAGGGTGTC TTCTAGCCAT A AGGCCA AGG 7620 
GAGTGGCAGA ACTGGGCCGC CTCTCTGGTT GACAAGCAAA CCACATGCTA AGGCTTGGAG 7680 
CAAGAGAGAATTTGTGTCTATTGGCAAAGA ACTAAGCCAG GAAGACATGG GCCATCCCTC 7740 
CGCTTTAGGG AAGCATATTT TAAACCTAAA CGTTGAACTT CTTCTTTGGC CTCACCAGTG 7800 

_ _ AAAACTTGTT GTCTTTAGTT CCTAAAGTTT CTTCTACTTT GGCACATTCC CCAGTTGAGC 7860 

70 AGCAGCCTCT ATGCTTCCAC GTTCAGGAAA AATTCCAGTC CTCATATCTT TTGTAGTTCA 7920 
CCCTCAAGCT CTCCCGCTTC ACCATCCAAT AGTTTCTCCC AAACCTTGGC ACCCCCCTAG 7980 
ACTTTGCTTC CAATGGTTTC TTCCAGACCA CTTTTCCTAG ATGAATATAT TCGTTTACCT 8040 
TACTAGGAAA ATTATTGGAA G ATTTTTTCT TTTACTTGAA ATTGGAGGCA TTTTAATAAC 8100 

__ TGGCGAACTG GAATGTGTTT CTGTATTTGT AGACAACCAT GTACCCATGC A AGTAGGTGA 8160 

75 ACATTCCACA GTGGCTGGGT GACCACAGCA GCTGCATGCA GACAGGACTG CCCGTGCTTT 8220 
GTGGGGAATC AGAGAATTTC C AAACTTGTT TCTC AGACTT CCGCAG ATCT CATC ACTTTG 8280 
ATTTCTAATC CATGCTGTAT TGGTGATTTT GTTTATCGTT CCTGTAACTT GTTCTACATT 8340 
CCACAGTCTT TACCGTTTTA TGTTCAAAAT TACAACAATC CCTGTCCATT GATTCCACTC 8400 
TGGAACTCTT TGTTCATGCC AATTTTGAAA TTTTAATACG AGCCTTCAAA TAAACACAGA 8460 
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AAAGAAAAAA AAAAAAAAAA 
SEq ID NO:165PEZ6 Protein sequence: 



1 11 21 31 41 SI 
I I I I I I 

MMMNVPGGGA AAVMMTGYNN GRCPRNSLYS DCI1EEKTW LQKKDNEGFG FVLRGAKADT 60 
PIEEFTPTPA FPALQYLES V DEGGV AWQAG LRTGDFLIEV NNENV VKVGH RQ WNMIRQG 1 20 
GNHLVLKVVT VTRNLDPDDT ARKKAPPPPK RAPTTALTLR SKSMTSELEE LVDKDKPEEI ISO 
VPASKPSRAA ENMAVEPRVA TIKQRPSSRC FPAGSDMNS V YERQGIAVMT PTVPGSPKAP 240 
FLGIPRGTMR RQKSIDSR1F LSGITEEERQ FLAPPMLKFT RSLSMPDTSE DIPPPPQS VP 300 
PSPPPPSPTT YNCPKSPTPR VYGTIKPAFN QNSAAKVSPA TRSDTVATMM REKGMYFRRE 360 
LDRYSLDSED LYSRNAGPQA NFRNKRGQMP ENPYSEVGKI ASKAVYVPAK PARRKGMLVK 420 
QSNVEDSPEK TCSIPIPTII VKEPSTSSSG KSSQGSSMEI DPQAPEPPSQ LRPDESLTVS 480 
SPFAAAIAGA VRDREKRLEA RRNSPAFLST DLGDEDVGLG PPAPRTRPSM FPEEGDFADE 540 
DSAEQLSSPM PSATPREPEN HFVGGAEASA PGEAGRPLNS TSKAQGPESS PAVPSASSGT 600 
AGPGNYVHPL TGRLLDPSSP LALALSARDR AMKESQQGPK GEAPKADLNK PLYIDTKMRP 660 
SLDAGFPTVT RQNTRGPLRR QF.TF.NKYETD LGRDRKGDDK KNMLIDIMDT SQQKSAGLLM 720 
VHTVDATKLD NALQEEDEKA EVEMKPDSSP SEVPEGVSET EGALQISAAP EPTTVPGRT1 780 
VAVGSMEEAV ILPFRIPPPP LASVDLDEDF IFTEPLPPPL EFANSFDIPD DRAASVPALS 840 
DLVKQKKSDT PQSPSLNSSQ PTNSADSKKP ASLSNCLPAS FLPPPESFDA VADSGIEEVD 900 
SRSSSDHHLE TTSTISTVSS ISTLSSEGGE NVDTCTVYAD GQAFMVDKPP VPPKPKMKPI 960 
IHKSNALYQD ALVEEDVDSF VIPPPAPPPP PGS AQPGMAK VLQPRTSKLW GDVTEIKSPI 1020 
LSGPKANVIS ELNSIIjQQMN REKLAKPGEG LDSPMGAKSA SLAPRSPEIM STISGTRSTT 1080 
VTFTVRPGTS QPITLQSRPP DYESRTSGTR RAPSPVVSPT EMNKETLPAP LSAATASPSP 1 140 
ALSDVFSLPS QPPSGDLFGL NPAGRSRSPS PS1LQQPISN KPFTTKPVHL WTKPD VADWL 1200 
ESLNLGEHKE AFMDNEIDGS HLPNLQKEDL IDLGVTRVGH RMNffiRALKQ LLDR 

SEQ ID NO.-166 PEZ4 DNA SEQUENCE 

Nucleic Acid Accession*: NM.0O0024 

Coding sequence: 220-1 4S1 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 
I I I I I I 

ACTGCGAAGC GGCTTCTTCA GAGCACGGGC TGGAACTGGC AGGCACCGCG AGCCCCTAGC 60 
ACCCGACAAG CTGAGTGTGC AGG ACGAGTC CCCACCACAC CCACACCACA GCCGCTGAAT 120 
GAGGCTTCCA GGCGTCCGCT CGCGGCCCGC AGAGCCCCGC CGTGGGTCCG CCCGCTGAGG 1 80 
CGCCCCCAGC CAGTGCGCTT ACCTGCCAGA CTGCGCGCCA_TG_GGGCAACC CGGGAACGGC 240 
AGCGCCTTCT TGCTGGCACC CAATAGAAGC CATGCGCCGG ACCACGACGT CACGCAGCAA 300 
AGGGACGAGG TGTGGGTGGT GGGCATGGGC ATCGTCATGT CTCTCATCGT CCTGGCCATC 360 
GTGTTTGGCA ATGTGCTGGT CATCACAGCC ATTGCCAAGT TCGAGCGTCT GCAG ACGGTC 420 
ACCAACTACT TCATCACTTC ACTGGCCTGT GCTGATCTGG TCATGGGCCT GGCAGTGGTG 480 
CCCTTTGGGG CCGCCCATAT TCTTATGAAA ATGTGGACTT TrGGCAACTT CTGGTGCGAG 540 
TTTTGGACTT CCATTGATGT GCTGTGCGTC ACGGCCAGCA TTGAGACCCT GTGCGTGATC 600 
GCAGTGGATC GCTACTTTGC CATTACTTCA CCTTTCAAGT ACCAGAGCCT GCTGACCAAG 660 
AATAAGGCCC GGGTGATCAT TCTGATGGTG TGGATTGTGT CAGGCCTTAC CTCCTTCTTG 720 
CCCATTCAGA TGCACTGGTA CCGGGCCACC CACCAGGAAG CCATCAACTG CTATGCCAAT 780 
GAGACCTGCT GTGACTTCTT CACGAACCAA GCCTATGCCA TTGCCTCTTC CATCGTGTCC 840 
TrCTACGTTC CCCTGGTGAT CATGGTCTTC GTCTACTCCA GGGTCTTTCA GGAGGCCAAA 900 
AGGCAGCTCC AGAAGATTGA CAAATCTGAG GGCCGCTTCC ATGTCCAGAA CCTTAGCCAG 960 
GTGGAGCAGG ATGGGCGGAC GGGGCATGGA CTCCGCAGAT CTTCCAAGTT CTGCTTGAAG 1020 
GAGCACAAAG CCCTCAAGAC GTTAGGCATC ATCATGGGCA CmCACCCT CTGCTGGCTG 1080 
CCCTTCTTCA TCGTTAACAT TGTGCATGTG ATCCAGGATA ACCTCATCCG TAAGGAAGTT 1 140 
TACATCCTCC TAA ATTGGAT AGGCTATGTC AATTCTGGTT TCAATCCCCT TATCTACTGC 1200 
CGGAGCCCAG ATTTCAGGAT TGCCTTCCAG GAGCTTCTGT GCCTGCGCAG GTCTTCTTTG 1260 
AAGGCCTATG GGAATGGCTA CTCCAGCAAC GGCAACACAG GGGAGCAGAG TGGATATCAC 1320 
GTGGAACAGG AGAAAGAAAA TAAACTGCTG TGTGAAGACC TCCCAGGCAC GGAAGACTTT 1380 
GTGGGCCATC AAGGTACTGT GCCTAGCGAT AACATTGATT CACAAGGGAG GAATTGTAGT 1440 
ACAAATGACT CACTGCTGJAAAGCAGTTTT TCTACTTTTA AAGACCCCCC CCCCCCCAAC 1500 
AGAACACTAA ACAGACTATT TAACITGAGG GTAATAAACT TAGAATAAAA TTGTAAAAAT 1560 
TGTATAG AGA TATGCAGAAG OA AGGGCATC CTTCTGCCTT TTTTATnTT TTAAGCTGTA 1620 
AAAAGAGAGA AAACTTATTT GAGTG ATTAT TTGTTATTTG TACAGTTCAG TTCCTCTTTG 1680 
CATGGAATTT GTAAGTTTAT GTCTAAAGAG CTTTAGTCCT AGAGGACCTG AGTCTGCTAT 1740 
ATTTTCATGA CTITTCCATG TATCTACCTC ACTATTCAAG TATTAGGGGT AATATATTGC 1 800 
TGCTGGTAAT TTGTATCTGA AGGAGATTTT CCTTCCTACA CCCTTGGACT TGAGGATTTT 1 860 
GAGTATCTCG GACCTTTCAG CTGTGAACAT GGACTCTTCC CCCACTCCTC TTATTTGCTC 1920 
ACACGGGGTA TTTTAGGCAG GGATTTGAGG AGCAGCTTCA GTTGTnTCC CGAGCAAAGG 1980 
TCTAAAGTTT ACAGTAAATA AAATGTTTGA CCATG 
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MGQPGNGS AF LLAPNRSHAP DHDVTQQRDE VWVVGMGIVM SLIVLAIVFG NVLVITAIAK 60 
FERLQTVTNY FITSLACADL VMGLAVVPFG AAH1LMKMWT FGNFWCEFWT SIDVLCVTAS 120 
ETLCVIAVD RYFAITSPFK YQSLLTKNKA RVIILMVWIV SGLTSFLPIQ MHWYRATHQE 180 
5 AINCYANETC CDFFTNQAYA IASSI VSFYV PLVIM VFVYS RVFQEAKRQL QKIDKSEGRF 240 
HVQNLSQVEQ DGRTGHGLRR SSKFCLKEHK ALKTLGUMG TFTLCWLPFF IVN1VHVIQD 300 
NLIRKEVYIL LNWIGYVNSG FNPLIYCRSP DFRIAFQELL CLRRSSLKAY GNGYSSNGNT 360 
GEQSGYHVEQ EKENKLLCED LPGTEDFVGH QGTVPSDNID SQGRNCSTND SLL 

10 



SEQ[DN0:168PEZ1 DNA SEQUENCE 

Nucleic Acid Accession #: NMJH4457 
15 Coding sequence: 143-2305 (underlined sequences correspond lo start and stop codons) 



1 11 21 31 41 51 
on I I I I I I 

ZU GAATTCGTTG TTGGGAAGGA CTGGGGAAAC AGCTGTAACA TTTGCCACCC TCAGAAGCTG 60 
CTGGTCCTGT GTCACACCAC CTTAGCCTCT TGATCGAGGA AGATTCTCGC TGAAGTCTGT 120 
TAATTCTACT TTTTGAGTAC TTATGAATAA CCACGTGTCT TCAAAACCAT CTACCATGAA 180 
GCTAAAACAT ACCATCAACC CTATTCTTTT ATATTTTATA CATTTTCTAA TATCACTTTA 240 
TACTATTTTA ACATACATTC CGTTTTATTT TTTCTCCGAG TCAAGACAAG AAA AATCAAA 300 

25 CCGAATTAAA GCAAAGCCTG TAAATTCAAA ACCTGATTCT GCATACAGAT CTGTTAATAG 360 
TTTGGATGGT TTGGCTTCAG TATTATACCC TGGATGTGAT ACTTTAGATA AAGTTTTTAC 420 
ATATGCAAAA AACAAATTTA AGAACAAAAG ACTCTTGGGA ACACGTGAAG TTTTAAATGA 480 
GGAAGATGAA GTACAACCAA ATGGAAAAAT TTTTAAAAAG GTTATTCTTG GACAGTATAA 540 
TTGGCTTTCC TATGAAG ATG TCTTTGTTCG AGCCTTTAAT TTTGGAAATG GATTACAG AT 600 

30 GTTGGGTCAG A AACCAA AGA CCAACATCGC CATCTTCTGT GAGACCAGGG CCGAGTGGAT 660 
GATAGCTGCA CAGGCGTGTT TTATGTATAA TTTTCAGCTT GTTACATTAT ATGCCACTCT 720 
AGGAGGTCCA GCCATTGTTC ATQCATTAAA TGAAACAGAG GTGACCAACA TCATTACTAG 780 
TAAAGAACTC TTACAAACAA AGTTGAAGGA TATAGTTTCT TTGGTCCCAC GCCTGCGGCA 840 
CATCATCACT GTTGATGGAA AGCCACCGAC CTGGTCCGAC TTOCCCAAGG GCATCATTGT 900 

3D GCATACCATGGCTGCAGTGGAGGCCCTGGGAGCCAAGGCCAGCATGGAAAACCAACCTCA 960 
TAGCAAACCA TTGCCCTCAG ATATTGCAGT AATCATGTAC ACAAGTGGAT CCACAGGACT 1020 
TCCAAAGGGA GTCATGATCT CACATAGTAA CATTATTGCT GGTATAACTG GGATGGCAGA 1080 
AAGGATTCCA GAACTAGGAG AGGAAGATGT CTACATTGGA TATTTGCCTC TGGCCCATGT 1 140 
TCTAGAATTA AGTGCTGAGC TTGTCTGTCT TTCTCACGGA TGCCGCATTG GTTACTCTTC 1200 

4U ACCACAGACT TTAGCAGATC AGTCTTCAAA AATTAAAAAA GGAAGCAAAG GGGATACATC 1260 
CATGTTGAAA CCAACACTGA TGGCAGCAGT TCCGGAAATC ATGGATCGGA TCTACAAAAA 1320 
TGTCATGAAT AAAGTCAGTG AAATGAGTAG TTTTCAACGT AATCTGTTTA TTCTGGCCTA 1380 
TAATTACAAA ATGGAACAGA TTTCAAAAGG ACGTAATACT CCACTGTGCG ACAGCTTTGT 1440 
TTTCCGGAAA GTTCGA AGCT TGCTAGGGGG AAATATTCGT CTCCTGTTGT GTGGTGGCGC 1500 

45 TCCACTTTCT GC AACCACGC AGCG ATTCAT GAAC ATCTGT TTCTGCTGTC CTGTTGGTCA 1560 

GGGATACGGG CTCACTG AAT CTGCTGGGGC TGGAACAATT TCCGAAGTGT GGGACTACAA 1620 
TACTGGCAGA GTGGGAGCAC CATTAGTTTG CTGTGAAATC AAATTAAAAA ACTGGGAGGA 1680 
AGGTGGATAC TTTAATACTG ATAAGCCACA CCCCAGGGGT GAAATTCTTA TTGGGGGCCA 1740 
AAGTGTGACA ATGGGGTACT ACAAAAATGA AGCAAAAACA AAAGCTGATT TCTCTGAAGA 1800 

50 TGAAAATGGA CAAAGGTGGC TCTGTACTGG GGATATTGGA GAGTTTGAAC CCGATGGATG 1860 
CTTAAAGATT ATTGATCGTA AAAAGGACCT TGTAAAACTA CAGGCAGGGG AATATGTTTC 1920 
TCTTGGGAAA GTAGAGGCAG CTTTGAAGAA TCTTCCACTA GTAGATAACA TTTGTGCATA 1980 
TGCAAACAGT TATCATTCTT ATGTCATTGG ATTTGTTGTG CCAAATCAAA AGGAACTAAC 2040 
TGAACTAGCT CGAAAGAAAG GACTTAAAGG GACTTGGGAG GAGCTGTGTA ACAGTTGTGA 2100 

55 AATGGAAAAT GAGGTACTTA AAGTGCTTTC CGAAGCTGCT ATTTCAGCAA GTCTGGAAAA 2160 
GTTTGAAATT CCAGTAAAAA TTCGTTTGAG TCCTGAACCG TGGACCCCTG AAACTGGTCT 2220 
GGTGACAGAT GCCTTCAAGC TGAAACGCAA AGAGCTTAAA ACACATTACC AGGCGGACAT 2280 
TGAGCGAATG TATGGAAGAA AATAA.TTATT CTCTTCTGGC ATCAGTTTGC TACAGTGAGC 2340 
TCACATCAAA TAGGAAAATA CTTGAAATGC ATGTCTCAAG CTGCAAGGCA AACTCCATTC 2400 

60 CTCATATTAA ACTATTACTT CTCATGACGT CACCATTTTT AACTGACAGG ATTAGTAAAA 2460 
CATTAAGACA GCAAACTTGT GTCTGTCTCT TCTTTCATTT TCCCCGCCAC CAACTTACTT 2520 
TACCACCTAT GACTGTACTT GTCAGTATGA GAATTTTTCT GAATCATATT GGGGAAGCAG 2580 
TGATTTTAAA ACCTCAAGTT TTTAAACATG ATTTATATGT TCTGTATAAT GTTCAGTTTG 2640 
TAACTTTTTA AAAGTTTGGA TGTATAGAGG GATAA ATAGG AAATATAAGA ATTGGTTATT 2700 

65 TGGGGGCTTT TTTACTTACT GTATTTAAAA ATACAAGGGT ATTGATATGA AATTATGTAA 2760 
ATTTCAAATG CTTATGAATC AAATCATTGT TGAACAAAAG ATTTGTTGCT GTGTAATTAT 2820 
TGTCTTGTAT GCATTTGAGA GAAATAAATA TACCCATACT TATGTTTTAA GAAGTTGAGA 2880 
TCTTGTG AAT ATATGCCTGT CAGTGTCTTC TTTATATATT TATTTTTTAT TAG AA AAAAT 2940 
GAAGTTTGGT TGGTGATGCA TGAAACAAAA TAGCAAGAGA GGGTTATAGT TTAATAGTAA 3000 

70 GGGAGATAAC ACAGCATGTG TAGCACCAGT TGATAATTGG TCTCTAGTAG CTTACTGTCA 3060 
AAATGTTCAA TGAAGTCTTC TGTTCATCTG TTGAAACTAG GAAAATACCC AAACTTAAAT 3120 
GGAAGAATTC TGAAAGAGAG GATAGAATTT AAAGAACAAG AGTATATAAA GTTATTCTTT 3180 
GAATATTTCG TTGACTATAT GTACATTGAG TTATCTATAT TTGTAAACAA ATTAGTCATG 3240 
GAAAATTATT CTATTCCAAA GTCTCCTTTT AGTCTAGATA ATCATTATTT CATTTTAAAA 3300 

75 TTAGTGTTTT TCATAGTTTG CACTGATGCG TGTATGGATG TGTGTGAGTC AGTGGTAGCT 3360 
TATTTAAAAA GCACCTTATC CTTTCTCCCA TAACCTTTGT ACACTAAAAA ATGAAAGAAT 3420 
TTAGAATGTA TTTGATG ATA GC ATTCTCAC TAAG AC ACAT GAGAATTTAA CTTTATAACC 3480 
GCGTGAGTTA AGATTTAATT CATAGGTTTT GATGTCATTG TTGAAGTTAT TTGTAATTCA 3540 
GAAACCTTGC TTGTGTGATA CATAGTAAGT CTCTTCATTT ATTACTGCTT GCCTGTTGTT 3600 
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ATATCTGG AT TATCAAAAGC AATAGTGCAC CAATTAAGAT GTGCTCAAAT CAGGACTTAA 3660 
ATCATAGGCA CCACATTTTT CATGTCAGAC TAGTTACTTT GTTGATTCTC AGTTACTGTA 3720 
GGCATCAAAA GGCAAAAATC A 



II 



31 



10 I | I | | | 



51 



MNNHVSSKPS TMKLKHTINP ILLYFIHFLI SLYT1LTYIP FYFFSESRQE KSNRKAKPV 60 
NSKPDSAYRS VNSLDGLASV LYPGCDTLDK VFTYAKNKFK NKRLLGTREV LNEEDEVQPN 120 
GKIFKKVILG QYNWLSYEDV FVRAFNFGNG LQMLGQKPKT NIA1FCETRA EWMIAAQACF 180 
MYNFQLVTLY ATLGGPAIVH ALNETEVTNI 1TSKELLQTK LKDIVSLVPR LRHHTVDGK 240 
PPTWSDFPKG IIVHTMAAVE ALGAKASMEN QPHSKPLPSD IAVMYTSGS TGLPKGVM1S 300 
HSNIIAGITG MAERIPELGE EDVYIGYLPL AHVLELSAEL VCLSHGCRIG YSSPQTLADQ 360 
SSKIKKGSKG DTSMLKFTLM AAVPEMDRI YKNVMNKVSE MSSFQRNLFI LAYNYKMEQI 420 
SKGRNTPLCD SFVFRKVRSL LGGNIRLLLC GGAPLSATTQ RFMNICFCCP VGQGYGLTES 480 
AGAGTISEVW DYNTGRVGAP LVCCE1KLKN WEEGGYFNTD KPHPRGEILI GGQSVTMGYY 540 
KNEAKTKADF SEDENGQRWL CTGDIGEFEP DGCLKIIDRK KDLVKLQAGE YVSLGKVEAA 600 
LKNLPLVDNI CAYANS YHSY VIGFWPNQK ELTELARKKG LKGTWEELCN SCEMENEVLK 660 
' VS LEKFEIPVKI RLSPEPWTPE TGLVTDAFKL KRKELKTHYQ ADD3RMYGRK 

SEQ ID N0:170 PCQ7 DNA SEQUENCE 

none found 

38-1 075(undertlned sequence corresponds to start and stop codon) 



55 
60 



AGCAACGACG CCGGGCAGCG GGAGCGGCGG CCGCGCCATG TGGCTGCTGG GGCCGCTGTG 
CCTGCTGCTG AGCAGCGCCG CGGAGAGCCA GCTGCTCCCC GGGAACAACT TCACCAATGA 
GTGCAACATA CCAGGCAACT TCATGTGCAG CAATGGACGG TGCATCCCGG GCGCCTGGCA 
GTGTGACGGG CTGCCTGACT GCTTCGACAA GAGTGATGAG AAGGAGTGCC CCAAGGCTAA 
GTCGAAATGT GGCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA T 
CGATGGCAGC GATGAAGAGA 
CCACTGCAAG AACGGCCTCT 
TCAAGACAAC AGTGATGAGG 
GTTTGTGACT TCAGAGAACC 
CAGCTCCGTC ATTTTTGTGC 
GAAGCGGAAC AACCTCATGA 
CCGCCTGGTG GTCCTGGACG 
CATCCAGTAT GTGGCCAGCC 
CTACTCCGAG GCCTTGCTGG 
CTCTTCTGAC ACGGAATCTC 
GAGTGCCAAC AGTGCCAGCT 



A GAACCCGGCA GTGGGCAGGT 



CCTGCTGGCA 



CTGGTCTTCC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAGGCT 
GACCTTCCTC 



ACCACCAGCG 
TGCTGCTGTC 
TCAATAATGG 
CCCCACCCTC 



3 AAGACACCAG CCACAGCCCG GGGCAGCCTG 



AACTATCTCT 
TGACATGATC 
CACCCTCATT 



GCATTCCCCT 



AGCACCTGTA A 
CCTCCCCCAG A 
CTTTTCTGTC A 



A ATGTTTCTGT 



AAGAAAACTT 



CTCATTCTGA 
GAGCCCCTCC 
TACACCTGCC 
ACCTGCCCGT 
GTATGTCCCT 



GGTCAGGGTC 



GCTTTGCACA 
TGGACGTGAG 
AAACCATCTA 
GAGCTTTCCT 
CATGAGTTTA 
CTGGCTCTAC 
AGCCAAGGAA 
GTGGCCCACA 
CCCTTAACAC 
ATCACAGGTG 
CACGCTCCTC 
AGGCCTCTCC 



GCTATATTGS ATGCTCAGAA 
TACCTTATAG CATTTGGGGA 
GTCACCCCCC CAAAAAAATT 
ATCCTATTTG ATGCCCCCAG TTCAGCAGAG 
TAACACCCTT CAGCAGTCGC AACGTTATTT 
CCCTGTATAA 
CAGCAGCATA 
TCCAAGTTCT 
AGCCACTTAC 




CCCAGCCTGT 
TTGCAAAGTC 
AGAGCCATGT 
TTCCCAAGGT 




ATTCTGGCTT 
TATCATCAGC 
CAGCTCCTAA 
CTGGTTTCTG 
ACTTGAGITG 
CTTGCTCATT 
CTTTTTACCT 
TCAATACCTC 
CCCAATACCA 
GTAGTTTCTC 
GATCTATTTT 



CTCATCCTAA 



GACTGTCACC 
GCCCAAAGTC 
CATGCAGCCT 
GTGCATTTGG 
CAGCAAGCTC 



TGGTTTTGTG 
CCCAAGAATG 
AATAGGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TGACCTGGCT 



CTCTGAGACA 



T CTGTAGACTT 



CCCGTGATAA 
TTTGAGGTTA 
CCGTGTATAG 
ACAGGGCCCG 
CCACACTGAC 
CCATTCAGAA 
AAACAGAGCC 
TTCTTTCTTT 



TCTCTATGTT TGTGCTAGTT 
CCTCCCTGCA GGAATAAGGG GTAAAACGTT 
TGATGAGGGG TAAAATGGAA 



CATGGGCAAG 
GAAATGCATT 
TGTATAGGAA 
AAAGGAGATC 
TGTGTGCCAG 
GGAAGCAGAA 



AATGTTCAGT 
TGGCATTCAG 
TGTTACAGAA 



2160 
2220 
2280 
2340 
2400 
2460 
2520 

2640 
2700 
2760 
2820 
2880 
2940 
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AAGCTAGCCA CTGGTATTTT GTTTTGTTTA AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

AACGGAAAGG AACCTAGCTG CCTGTATCTT TCATTOTTAA AATAGCACTT GAGTTATTTT 30S0 

CTGAGTAATC CAATAAAGAA CTTTTGATC3A CAGCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTGAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGCGCAGT 3240 

GAGTTAATCT CACTCGCTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AGATAAGGGA TGCCTACTAA TGCTTTTTTA AAACAAACAG GGACATTTTT ATTATAGATT 33SO 

TGATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAGGGTTT 3420 

TTTTTGGGGG GAGGGGGTTT GTTTTCCAAC TCAAGATGGC ACATTAGTGG CCAGCAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CGCCAACCAG 3540 

AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAGATCCGTG TTTTATTTTA GCATTAAATT 3600 

AGTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTACCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGACG CTCAAAGTTA ATGTAAACTG 3720 

GAAAGGTTGT GTGTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGGTTTTGT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTTTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMAKK 3840 

AAMMAAAAAC AWYWTTGGGG GGGCTTGGGC CTCGGAAAAA GTTTTTAACA CCACTTCGGG 3900 

TGGGGCGGCG GGGCCCACGT AGGTACGGCG ACCACGCGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAACCCTGG CCAAGAAAAA GGTGGCGAGA ATTCTCCACA CCAGAAAAAA ACGCGCCGGG 4020 

GGAAACCGCA GAGTGTTGCG TAAACCACAC CCGAAGAGAG AACTCAGAAG CACACAAGCG 4080 
GGACTCAACC AGGAGGACCC AAGGGAACCC GATAGAGTAC G 



Protein Accession*: 



KNGLCIDKSF ICDGQNNCQD NSDEESCESS QEPGSGQVFV TSENQLVYYP SITYAIIGSS 180 

VIFVLWALL ALVLHHQRKR KNLMTLPVHR LQHPVLLSRL WLDHPHHCN VTYMVNNGIQ 240 

YVASQAEQNA SEVGSPPSYS EALLDQRPAW YDLPPPPYSS DTESLNQADL PPYRSRSGSA 300 
NSASSQAASS LLSVEDTSHS PGQPGPQEGT AEPRDSEPSQ GTEEV 

SEQ ID N0:172 PEL3 DNA SEQUENCE 
KM_005856.1 

57-1535 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 

I I I I I I 

GTCATATTGA ACATTCCAGA TAGCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGr CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGGATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAA.T GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGOAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 

CGTTTTACAA GAAAACAATG GGGCTGGTTT TGCTTCCCCG TGCATGATTT ACTCTTAGAG 1620 

A TTTTTATTAA ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

G TGCAGGCTGC AGTGGCTCCC CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 1740 

CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1800 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACG1GGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 
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GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 
ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTAT 
CTGAGTTCAA AGCCATCTT 



&CATTGAT GGTGGTGACC 2460 



Protein Accession*: 



65 
70 



21 



31 



I 



MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 
SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLWKFMG SKCSNSG1EC 
DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQM YSSQRKSWHP VCQDDWNENY 
GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 
CLACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 
PLNNPWHWTA FAGILRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 
VKFVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 
TPAMICAGFL QGNVDSCQGD SGGPLVTSNN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 
TDWIYRQMKA NG 



Nucleic Acid Accession*: 



SEQ ID N0:174 PBJ4 DNA SEQUENCE 




1 30-1 086 (underlined sequences correspond to start and stop codons) 



A TGA TGGTGGA TCCCAATGGC 



GTCACAGTTC 

AATGAATCCA GTGCTACATA CTTCATCCTA 
CCTTCCCATT GTGCTCCCTC 
ATCATCTACA TTGTGCGGAC TGAGCACAGC 
CTGCATGAGC CCATGTATAT ATTTCTTTGC ATGCTTTCAG GCATTGACAT CCTCATCTCC 
TTCTGGTTCA ATTCCACTAC CATCCAGTTT 



ACCTCATCCA 
GATGCTTGTC 
CTGCTGGCCA 



CTGATGGCAC 
TCCCATTCCT 
AATGTCGTCT 



TGCTACAGAT GTTTGCCATC C 
TGGCTTTTGA CCGCTATGTG G 
TGCCTCGTGT CACCAAAATT G 
CCCTTCCTGT CTTCATCAAG C 
ACTGCCTACA CCAAGATGTC A 
ATGGCCTTAT CGTCATCATC T 



ATTGGATTGT 



CCCTAGGTGT 
GTTAACATTT 
ATCCTTCAAA 



GCACTTGCGT CTCTCATGTG 
CCATGGTGCA TCGCTTTAGC 
TCTATCTGCT GGTTCCTCCT 
1MCGACAGCG CATCCTTCGA 
CAGTGATCAA ACTTCTTTTC 
TGGAAGACAG TATTCAGAAA 
TATGAAACTG GTTGGGGAAT 



AAAAATTTCC 



GCCTGGACTC 
TGACACGTGA 
TCATATTCTA 
ACTCTCCACT 
CAATTGTCTA 
TGGCCACACA 
CCTCTGATTC 
TTAATAAAAA 



CCAAATCTAA ACTG 



A TGGTACATCT AGAGAACATT T 
A ATGAGATAAT C 
- ACATTGGATC T 



C CTAAGCACAG 



ACAGCATTCT 
CAAAGGAAAA 
CCTCTTCAGA 



AGAGTACATT 
ATGGACCCTG 
TTAGTACCCT 
GGGGTCATAC 



ACTAGCACTT AAGGGGAAGA TTGGAAGTAA 
TACCTACGTT AATGAAAGTT GACACACTGT TCTGAGAGTT 
- TTAATTTTCT TATCAACCCT TTAATTAGGC 
ATGGGAAAAT TGATGTTCAG TGGGGATCAG 
ATTAAAAAAA AAAGACTTCA TGCCCAATCT 
CAACAGGGTA GTGGGTTAGA GATTTCCAGA 
TTCTTCTCAC 
AATCCCACTA 



TTAGTACCCT CATTSTAGCC 



TGATAGGCAG 
TGAAGATAAC 
ACCATGCTTT 
ATCTGACTTA 
ATAGGTTTCA 
TACTAAAACA 
CCTGATATGG 
AATGCCTATT 



GTC TTGGAAG 
CATAGGGCTT 
TGAGGTTAGG 
ATTGGCCTTT 
ATTTGGGGCT 
GGCATGGGAA 
TCTTCAACAG 
TGTGATCATA 



ATTATGGAAG 



GAGCCACCAG 



TAATACTTGT 
CATCTCTGTT 
GAACATAATA 
GCTGAACACA 



TATGTGGTAA 
CATGCTTTCA 
ATTTGCTGCT 



CAGTCTTAAC 
GTTTCATTTT 
TCCCCTTTTG 
GGACTGTAAG 
TGCTCTTTGC 
TTGACACCGG 



AAGTATGGAA 
AAAGTGAGGG 
ACTTTGAGAC 
GGGGCTATTA 



AGCCTTGAAA 
TTCACAGCAT 
AAAGATATTA 
TGAATTAAAT 
CATATGATGT 
GTCTTACATT 
GAATTTCCTG 
TCCAATTGCC 
ATTCTTATTC 
TAGGTGTTTC 



CTTTTTCAAT 
TAATGGATAT 
CCCATGAGGG 
TCATCATTGA 
TTATTTTTCA 
CTTCTTTGAG 
3 ATGTGCAATT T 
A TTAAATGTGA C 



AATCTTCAGG 
CGGGAAAGCA 
CCAAGGGTTA 
AAATTACATA 
CCTCAGGTTC 
CATATTTGGA 
CACTGTTTAT 
ATCCCCCAGC 
TCAAACCTGA 



2700 
2760 
2820 
2880 
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TATGTGTTAC ACAGAGTTAA TTAACCNGAA AGGCCTGGNA A 



5 

10 



30 
35 



65 
70 
75 



1 U 21 31 41 51 

MVDPNGNESS ATYFILIGLP GLEEAQFVJLA FPLCSLYLIA VLGNLTI 

MYIFLCMLSG IDILISTSSM PKMLAIFWFN STTIQFDACL LQMFAIHSLS GMESTVLLAM 120 

AFDRYVAICH PLRHATVLTL PRVTKIGVAA WRGAALMAP LPVFIKQLPF CRSNILSHSY 180 

CLHQDVMKLA CDDIRVNWY GLIVIISAIG LDSLLISFSY LLILKTVLGL TREAQAKAFG 240 

TCVSHVCAVF IFYVPFIGLS MVHRFSKRRD SPLPVILANI YLLVPPVLNP IVYGVKTKBI 300 
RQRILRLFHV ATHASEP 

SEQ ID NO:176 PM72 DNA SEQUENCE 
Nucleic Acid Accession* NMJ04624.1 

Coding sequence: 57-1544 (underlined sequences correspond to start and stop codons) 

TCGGAGCCTG CGGAGGGTGG TGGTGGTGGT GGTGGTGGCC CTCGCCCGCC TCACTCATGC 60 

CTCCTCCTCC TCTGCTCTCG CTCAGGCGCC TCGGTGGCGG TTGGTCGGCG GTTACGCGGC 120 

TGGTGGTCGC GGCGGCCGGG GCTCGCTCTC GGGGAGGCCG GGGCGGATCT CGCGGCGCAG 180 

GCGGCGGCGG CCGAGGTGGG GTCGCGCGGC GGAGGCGGCT CGAGCTTCGT GCTGCGCGCT 240 

CGCTCTTGGG CTCCTCGCTG CAGGAGGAGT GTGACTATGT GCAGATGATC GAGGTGCAGC 300 

ACAAGCAGTG CCTGGAGGAG GCCCAGCTGG AGAATGAGAC AATAGGCTGC AGCAAGATGT 3S0 

GGGACAACCT CACCTGCTGG CCAGCCACCC CTCGGGGCCA GGTAGTTGTC TTGGCCTGTC 420 

CCCTCATCTT CAAGCTCTTC TCCTCCATTC AAGGCCGCAA TGTAAGCCGC AGCTGCACCG 480 

ACGAAGGCTG GACGCACCTG GAGCCTGGCC CGTACCCCAT TGCCTGTGGT TTGGATGACA 540 

AGGCAGCGAG TTTGGATGAG CAGCAGACCA TGTTCTACGG TTCTGTGAAG ACCGGCTACA 500 
CCATTGGCTA CGGCCTGTCC CTCGCCACCC TTCTGGTCGC CACAGCTATC C 




TTAAGCCTGA AGTGAAGATG GTCTTTGAGC T 

TGGCTATCCT CTACTGCTTC CTCAATGGTG AGGTGCAGGC GGAGCTGAGG CGGAAGTGGC 1380 

GGCGCTGGCA CCTGCAGGGC GTCCTGGGCT GGAACCCCAA ATACCGGCAC CCGTCGGGAG 1440 

GCAGCAACGG CGCCACGTGC AGCACGCAGG TTTCCATGCT GACCCGCGTC AGCCCAGGTG 1500 

CCCGCCGCTC CTCCAGCTTC CAAGCCGAAG TCTCCCTGGT CTGACCACCA GGATCCCAGC 1560 

CCAAGCGGCC CCTCCCGCCC CTTCCCACTC GCAGCAGACG CCGGGGACAG AGGCCTGCCC 1520 

GGGCGCGCCA GCCCCGGCCC TGGGCTCGGA GGCTGCCCCC GGCCCCCTGG TCTCTGGTCC 1580 

GGACACTCCT AGAGAACGCA GCCCTAGAGC CTGCCTGGAG CGTTTCTAGC AAGTGAGAGA 1740 

GATGGGAGCT CCTCTCCTGG AGGATGCAGG TGGAACTCAG TCATTAGACT CCTCCTCCAA 1800 

AGGCCCCCTA CGCCAATCAA GGGCAAAAAG TCTACATACT TTCATCCTGA CTCTGCCCCC 1860 

TGCTGGCTCT TCTGCCCAAT TGGAGGAAAG CAACCGGTGG ATCCTGAAAC AACACTGGTG 1920 

TGACCTGAGG GCAGAAAGGT TCTGCCCGGG AAGGTCACCA GCACCAACAC CACGGTAGTG 1980 

CCTGAAATTT CACCATTGCT GTCAAGTTCC TTTGGGTTAA GCATTACCAC TCAGGCATTT 2040 

GACTGAAGAT GCAGCTCACT ACCCTATTCT CTCTTTACGC TTAGTTATCA GCTTTTTAAA 2100 

GTGGGTTATT CTGGAGTTTT TGTTTGGAGA GCACACCTAT CTTAGTGGTT CCCCACCGAA 2160 

GTGGACTGGC CCCTGGGTCA GTCTGGTGGG AGGACGGTGC AACCCAAGGA CTGAGGGACT 2220 

CTGAAGCCTC TGGGAAATGA GAAGGCAGCC ACCAGCGAAT GCTAGGTCTC GGACTAAGCC 2280 

TACCTGCTCT CCAAGTCTCA GTGGCTTCAT CTGTCAAGTG GGACTCTGTC ACACCAGCCA 2340 

TTCTTATCTC TCTGTGCTGT GGAAGCAACA GGAATCAAGA GACTGCCCTC CTTGTCCACC 2400 

CACCTATGTG CCAACTGTTG TAACTAGGCT CAGAGATGTG CACCCATGGG CTCTGACAGA 2460 

AAGCAGATCC TCACCCTGCT ACACATACAG GATTTGAACT CAGATCTGTC TGATAGGAAT 2520 

GTGAAAGCAC GGACTCTTAC TGCTAACTTT TGTGTATCGT AACCAGCCAG ATCCTCTTGG 2580 

TTATTTGTTT ACCACTTGTA TTATTAATGC CATTATCCCT GAATTCCCCT TGCCACCCCA 2640 

CCCTCCCTGG AGTGTGGCTG AGGAGGCCTC CATCTCATGT ATCATCTGGA TAGGAGCCTG 2700 

CTGGTCACAG CCTCCTCTGT CTGCCCTTCA CCCCAGTGGC CACTCAGCTT CCTACCCACA 2760 

CCTCTGCCAG AAGATCCCCT CAGGACTGCA ACAGGCTTGT GCAACAATAA ATGTTGGCTT 2820 
GGAAAAAAAA AAAA 

SEQ ID NO:177 PM72 Protein sequence: 



V QHKQCLEEAQ LEMETIGCSK MWDNLTCWPA TPRGQVWLA 
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CPLIFKLFSS IQGRNVSRSC TDEGWTHLEP GPYPIACGLD DKAASLDEQQ TMFYGSVKTG 
YTIGYGLSLA TLLVATAILS LFRKLHCTRN YIHMHLFISF ILRAAAVFIK DLALFDSGES 
DQCSEGSVGC KAAKVFFQYC VMANFFWLLV EGLYLYTLLA VSFFSERKYF W 
PSTFTMVWTI ARIHFEDYGL LRCWDTINSS LWWIIKGPIL TSILVNFILF I 

LRPPDIRKSD SSPYSRLARS TLLLIPLFGV HYIMFAFFPD NFKPEVKMVF ELWGSFQGF 4 

WAILYCFLN GEVQAELRRK WRRWHLQGVL GWNPKYRHPS GGSNGATCST QVSMLTRVSP 4 
GARRSSSFQA EVSLV 

SEQ ID NO:178 BFF8 DNASEQUENCE 
70 (underlined sequences correspond lo start and slop codons) 

1 11 21 31 41 

I I I I I 

ATGAGCGGTO CGGGGGTC3GC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC C 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA G 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT G 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA T 

GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 

ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 

AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 

GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 

CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 

G CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

,CAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 

GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 

GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 960 

TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 

CTGTTCTGGG CAAAGTGTGG CC CAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 

GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 

CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGC TCCCTTG 1260 

GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 

GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 

TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 

TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 

CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1S20 

GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 

AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 

ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 

ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1860 

CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATT1CCCCAA GGTCTCCACC 1920 

AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 

CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTT2A 



1 11 21 31 41 51 

1 I I I I I 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPO LRQSDPQKRN LDLEKSLQFL 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 
GGTQDGEPLO TVLAHLAALA PVCOPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 
HLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 
AHFPLSLGLG LTSGGHLTGG WEQPGNIAAG AVPRALPSQG DHEKGVEGGP FPSRCGNSSE 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGICCSHCPK PSCFPDGPSG NHLSRASAPL 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 
SVKSISNSAN SQGKARPQPG SFHKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK ! 
EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ i 
TQELRHLKSL LEGSQKPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA i 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

SEQ ID NOM80 BCR4 DNA SEQUENCE 
Nucleic Acid Accession*: NM_012319.2 

Coding sequence: 138-2 



A CAACGAGGCC 120 
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GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

5 TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAG TCAGACCATG 480 

AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

10 GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTC 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATO AAAGAGCCGG GTGAGCCGGC 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 

GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 

15 GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTCA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 

CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

20 TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 

TTCTTCTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTCG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

25 CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 

AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 

30 CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

35 GAATTTTCAT TGGTCATTAT GCTGAAAATG TCTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTIACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

40 AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

45 TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

50 AAATATATTT AATGAATTCA AGCAATATAC AC1TGACCAA GAAATTGGAA TTTCAAAATG 3120 

TATTGCCAAG TTATATATCA CCAAAAGCTG TATOACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 

55 GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATCTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 



SEQ ID N0:181 BCR4 PROTEIN SEQUENCE 
60 Protein Accession*: NP.036451 



1 11 21 31 41 

I I I I I 

05 MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI 
FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLPPKDVSS STPPSVTSKS 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL 

70 RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ 
TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE 
HSHFHDTLGQ SDDL1HHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA 

75 MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA 
HALSAHLAYL GMATGIF1GH YAENVSMWI F ALTAGLFMYV ALVDMVPEML 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 



51 
I 

STRQYHLQQL 60 

HASGRRNVKD 180 

RVSRLAGRKT 240 

CPAIINQIDA 300 

RVFFKFLLSF 360 

NIEESAYFDS 420 

SKYESQLSTN 480 

YVPRGCKNKC 540 

GVATLAWMVI 600 

GMTVKQAVLY 660 

HMDASDHGCS 720 
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SEQ ID NO:182 BCY2 DNA sequence 

Nucleic Acid Accession* NM.O01203 

Coding sequence: 274-1 782 (underlined sequences correspond to start and stop codons) 

5 1 11 21 31 41 51 
I I I I I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCOCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 

1 0 CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 
GACGG ATATT GTTTCACGAT GATAGAAG AG G ATG ACTCTG GGTTGCCTGT GGTCACTTCT 480 

1 5 GGTTGCCTAG GACTAGAAGG CTC AG ATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 
TATAAAAGAC A AGA AACCAG ACCTCG ATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 

20 ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTCG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATG A AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

25 GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 
TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
CATCGAGATC TGAAAAGTA A AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1 320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 

30 ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 
AATCACTTCC AGTCTTACAT CATGGCTG AC ATGTATAGTT TTGGCCTCAT CCTTTGGG AG 1500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 

35 ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGGCA AAATGTCAGA GTCCCAGGAC ATTAAACTCTGATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 

40 TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 

Pra?J°Aweas?on ^ P "" eln SeaUen ° e np_001194 

45 

1 11 21 31 41 51 
I I I I I I 

MLLRSAGKJLN VGTKKEDGES TAFTPRPKVL RCKCHHHCPE DS VNNICSTD GYCFIMIEED 60 
„ DSGLPVVTSG CLGLEGSDFQ CRDTP1PHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
50 GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETY1 PPGESLRDL1 180 

EQSQSSGSGS GLPLLVQRT1 AKQ1QMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG F1AADIKGTG SWTQLYLHD YHENGSLYDY LKSTTLDAKS 300 
MUCLAYSSVS GLCHLHTE1F STQGKPA1AH RDLKSKNILV KKNGTCCIAD LGLAVKHSD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
55 EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 



SEQ ID NO: 184 CBF9 DNA sequence 
Nucleic Acid Accession #: 
Coding Sequence: 



51 (undedined sequences correspond to start and stop codons) 




CCCCCTGGCC CGAGCCGCGC C 
TCGCCGCTCT CCTTCCGTTA TATCAACATG C 
TCCTGT TTTCCAGAGT G 

G GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTCGGC TGCAGTGGAC 
ATCATGTTTC TGTTAGATGG GTCTAACAGC GTCGGGAAAG GGAGCTTTGA AAGGTCCAAG 
CACTTTGCCA TCACAGTCTG TGACGGTCTG GACATCAGCC CCGAGAGGGT CAGAGTGGGA 
GCATTCCAGT TCAGTTCCAC TCCTCATCTG GAATTCCCCT TGGATTCATT TTCAACCCAA 
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CAGGAAGTGA AGGCAAGAAT CAAGAGGATG GTTTTCAAAG GAGGGCGCAC GGAGACGGAA 720 

CTTGCTCTGA AATACCTTCT GCACAGAGGG TTGCCTGGAG GCAGAAATGC TTCTGTGCCC 780 

CAGATCCTCA TCATCGTCAC TGATGGGAAG TCCCAGGGGG ATGTGGCACT GCCATCCAAG 840 

CAGCTGAAGG AAAGGGGTGT CACTGTGTTT GCTGTGGGGG TCAGGTTTCC CAGGTGGGAG 900 

GAGCTGCATG CACTGGCCAG CGAGCCTAGA GGGCAGCACG TGCTCTTGGC TGAGCAGGTG 960 

GAGGATGCCA CCAACGGCCT CTTCAGCACC CTCAGCAGCT CGGCCATCTG CTCCAGCGCC 1020 

ACGCCAGACT GCAGGGTCGA GGCTCACCCC TGTGAGCACA GGACGC TGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATGCCCC ATGCTGGAGA GGATCGCGGC GGACCCTTGC GGTGCTGGCT 1140 

GCACACTGTC CCTTCTACAG CTGGAAGAGA GTGTTCCTAA CCCACCCTGC CACCIGCTAC 1200 

AGGACCACCT GCCCAGGCCC CTGTGACTCG CAGCCCTGCC AGAATGGAGG CACAIGTGTT 1260 

CCAGAAGGAC TGGACGGCTA CCAGTGCCTC TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC 1320 

TGTGCCCTGA AGCTGAGCCT GGAATGCAGG GTCGACCTCC TCTTCCTGCT GGACAGCTCT 1380 

GCGGGCACCA CTCTGGACGG CTTCCTGCGG GCCAAAGTCT TCGTGAAGCG GTTTGTGCGG 1440 

GCCGTGCTGA GCGAGGACTC TCGGGCCCGA GTGGGTGTGG CCACATACAG CAGGGAGCTG 1500 

CTGGTGGCGG TGCCTGTGGG GGAGTACCAG GATGTGCCTG ACCTGGTCTG GAGCCTCGAT 1560 

GGCATTCCCT TCCGTGGTGG CCCCACCCTG ACGGGCAGTG CCTTGCGGCA GGCGGCAGAG 1620 

CGTGGCTTCG GGAGCGCCAC CAGGACAGGC CAGGACCGGC CACGTAGAGT GGTGGTTTTG 1680 

CTCACTGAGT CACACTCCGA GGATGAGGTT GCGGGCCCAG CGCGTCACGC AAGGGCGCGA 1740 

GAGCTGCTCC TGCTGGGTGT AGGCAGTGAG GCCGTGCGGG CAGAGCTGGA GGAGATCACA 1800 

GGCAGCCCAA AGCATGTGAT GGTCTACTCG GATCCTCAGG ATCTGTTCAA CCAAATCCCT 1860 

GAGCTGCAGG GGAAGCTGTG CAGCCGGCAG CGGCCAGGGT GCCGGACACA AGCCCTGGAC 1920 

CTCGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCCG AGAATTTTGC TCAGATGCAG 1980 

AGCTTTCTGA GAAGCTGTGC CCTCCAGTTT GAGGTGAACC CTGACGTGAC ACAGGTCGGC 2040 

CTGGTGGTGT ATGGCAGCCA GGTGCAGACT GCCTTCGGGC TGGACACCAA ACCCACCCGG 2100 

GCTGCGATGC TGCGGGCCAT TAGCCAGGCC CCCTACCTAG GTOGGGTGGG CTCAGCCGGC 2160 

ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACCGTCC AGAGGGGTGC CCGGCCTGGT 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GGGAGAGGCG CAGAGGATGC AGCCGTTCCT 2280 

GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTSGTCG TGGGCGTGGG GCCTGTCCTA 2340 

AGTGAGGGTC TGCGGAGGCT TGCAGGTCCC CGGGATTCCC TGATCCACGT GGCAGCTTAC 2400 

GCCGACCTGC GGTACCACCA GGACGTGCTC ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG 2460 

CCAGTCAACC TCTGCAAACC CAGCCCGTGC ATGAATGAGG GCAGCTGCGT CCTGCAGAAT 2520 

GGGAGCTACC GCTGCAAGTG TCGGGATGGC TGGGAGGGCC CCCACTGCGA GAACCGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC TTGAGACGCC CCTGAGGCAC 2640 

ATCGCTCCCG TGCAGGAGGG CAGCAGCCGT ACCCCTCCCA GCAACTACAG AGAAGGCCTG 2700 

GGCACTGAAA TGGTGCCTAC CTTCTGGAAT GTCTGTGCCC CAGGTCC TTA G AATGTCTGC 2760 

TTCCCGCCGT GGCCAGGACC ACTATTCTCA CTGAGGGAGG AGGATGTCCC AACTGCAGCC 2820 

ATGCTGCTTA GAGACAAGAA AGCAGCTGAT GTCACCCACA AACGATGTTG TTGAAAAGTT 2880 

TTGATGTGTA AGTAAATACC CACTTTCTGT ACCTGCTGTG CCTTGTTGAG GCTATGTCAT 2940 

CTGCCACCTT TCCCTTGAGG ATAAACAAGG GGTCCTGAAG ACTTAAATTT AGCGGCCTGA 3000 

CGTTCCTTTG CACACAATCA ATGCTCGCCA GAATGTTGTT GACACAGTAA TGCCCAGCAG 3060 

AGGCCTTTAC TAGAGCATCC TTTGGACGGC GAAGGCCACG GCCTTTCAAG ATGGAAAGCA 3120 

GCAGCTTTTC CACTTCCCCA GAGACATTCT GGATGCATTT GCATTGAGTC TGAAAGGGGG 3180 

CTTGAGGGAC GTTTGTGACT TCTTGGCGAC TGCCTTTTGT GTGTGGAAGA GACTTGGAAA 3240 

GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTGGTTGA TGATGGGGGA GGGGCTGAGT 3300 

TGTGCATGGG CCCAGGTCTG GAGGGCCACG TAAAATCGTT CTGAGTCGTG AGCAGTGTCC 3360 
ACCTTGAAGG TCTTC 



I I 



SVGKGSFERS KHFA1TVCDG LDISPEKVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 
FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDA1WGLFS TLSSSAICSS ATPDCRVEAH 
PCEHRTLEM7 REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTItDGFL 
RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 
LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RBLLLLGVGS 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 
SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLETKPT RAAMLRAISQ 
APYLGGVGSA GTALI.HIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 
RTPPSNYREG LGTEMVPTFW NVCAPGP 



70 SEQIDNO:1B6PAV1Df 

Nucleic Acid Accession* AF272890 1 

Coding Sequence: 87-1 520 (undedined sequences correspond lo start and stop codons) 



TGCTACCCGC GCCCGGGCTT CTGGGGTGTT CCCCAACCAC GGCCCAGCCC TGCCACACCC 
CCCGCCCCCG GCCTCCGCAG CTCGGCATGG GCGCGGGGGT GCTCGTCCTG GGCGCCTCCG 
AGCCCGGTAA CCTGTCGTCG GCCGCACCGC TCCCCGACGG CGCGGCCACC GCGGCGCGGC 



?EQ ID NQ:185CPF9 Protein sequence 
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TGCTGGTGCC CGCGTCGCCG CCCGCCTCGT T 
CGCTGTCTCA GCAGTGGACA GCGGGCATGG GTCTGCTGAT GGCGCTCATC GTGCTGCTCA 
TCGTGGCGGG CAATGTGCTG GTGATCGTGG CCATCGCCAA GACGCCGCGG CTGCAGACGC 
TCACCAACCT CTTCATCATG TCCCTGGCCA GCGCCGACCT GGTCATGGGG CTGCTGGTGG 
TGCCGTTCGG GGCCACCATC GTGGTGTGGG GCCGCTGGGA GTACGGCTCC T 
AGCTGTGGAC C 

TTGCCCTGGA CCGCTACCTC GCCATCACCT CGCCCTTCCG CTACCAGAGC CTGCTGACGC 



: GTCACCAACC 



AGMGCAGGT 



CTCGCCCTCG 



CCCGTCCCCG 
GCCCCGCTGG 
GAGCAGAAGG 
CCCTTCTTCC 
TTCGTCTTCT 
CGCAGCCCCG 
CGCCGGCGCC 
GGACCCCCGC 
CGTCGGGGCC ACGCCGCCCG C 



CCGCCGCCGC 
CGCGCCTCGT 
TCTTCACGCT 
AGCTGGTGCC 
TCAACCCCAT 
GCTGCGCGCG 
CGGGCTGTCT 
ACGACGATGT 
ACGGCGGGGC GGCGGCGGAC AGCGACTCGA G 
CCTCGGAATC C 



GGCCCTACGC 
CTCCTGGCTG 
CGACCGCCTC 
CATCTACTGC 
CAGGGCTGCC 



GGGCCTACGC CATCGCCTCG TCCGTAGTCT 
TCGTGTACCT GCGGGTGTTC CGCGAGGCCC 
AGCGCCGTTT CCTCGGCGGC CCAGCGCGGC 
CGCCCGCGCC GCCGCCCGGA CCCCCGCGCC 
CCAACGGGCG TGCGGGTAAG CGGCGGCCCT 
CGCTCAAGAC GCTGGGCATC ATCATGGGCG 
TGGCCAACGT GGTGAAGGCC TTCCACCGCG 
TCAACTGGCT GGGCTACGCC AACTCGGCCT 
ACTTCCGCAA GGCCTTCCAG GGACTGCTCT 
ACGCGACCCA CGGAGACCGG CCGCGCGCCT 



TTTGGGAAGG 



AGATCTGTGT TTACTTAAGA CCGATAGCAG GTGAACTCGA AGCCCACAAT 
ATCATCCGAG GCAAAGAGAA AAGCCACGGA CCGTTGCACA AAAAGGAAAG 
GATGGGAGAG TGGCTTGCTG ATGTTCCTTG TTG 



I 



I 



21 



I 



I 



CVTASIETLC 



P EPLSQQWTAG 
L ASADLVMGLL WPFGATIW 
I TSPFRYQSLL TRARARGLVC 
V VSFYVPLCIM 

E AQKQVKKIDS CERRFLGGPA RPPSPSPSFV PAPAPPPGPP RPAAAAATAP 
LANGRAGKRR PSRLVALREQ KALKTLGIIM GVFTLCWLPF FLANWKAFH RELVPDRLFV 
FFNWLGYANS AFNPIIYCRS PDFRKAFQGL LCCARRAARR RHATHGDRPR ASGCLARPGP 
PPSPGAASDD DDDDWGATP PARLLEPWAG CNGGAAADSD SSLDEPCRPG FASESKV 



SEQlDNO:1B3BC02DNAs & 



81-3080 (underiined sequences correspond to start and stop oodons) 



1 11 21 31 41 51 
I I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAG A TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGUAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATGAATAAG G ATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAU 1 GO )AC C C i GGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
GTCACTCTGG CATTCACCTC TCTTCAG ATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1 800 
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TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGOGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGG A AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 
TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGG AACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATG ATGAGG ACTACCAGGA ACTCATTGAA G ACATAGTTC 2880 
GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 
TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATCA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 
GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3600 
ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

gEQ ID NQ:189 BgQg PrQjein sequence 

Protein Accession ft: CAB92285 



1 11 21 31 41 51 
I I I I I I 

MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNffGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 180 
DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTS V VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 
STGVHCSCPV GFTLQLDGKT CKD1DECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGML 480 
SSDVTTKTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMHTVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKA1R TLRKAVHREQ 600 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS G ALKTPEA WN MSECGGLCQP GEYS ADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRC1RC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN 1TQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRIL1V VPEIFLP1ED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 
IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSF1RLLRSK VSRFLRPYK 

S5Q ID NQ:1S0 BFfjl pNA Sfi SMfiB SS 

Nucleic Acid Accession*: AF007170 

Coding sequence: 1-1 72S (underlined sequences correspond to stop codon) 

1 11 21 31 41 51 
I I I I I I 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 
CTGGAGATGC AGGCCATGAT GACCTTTG AC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 
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GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1 140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 
TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 
CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGIA£CTTTG TGCAGCAGTT 1740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTC AGCTTGT GTG ACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAA AAT GCCAGGGCTT GATGGA AG AG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 

SEQ IDH0:191BFfl1 Protein sequence 



1 11 21 31 41 51 
I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSPSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS 120 
FIKGG1KVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVND3EAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATY1YM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 



gEfjipN?:1?2PFQgpNACTenge 

Nucleic Acid Accession*: NM.032683 

Coding sequence: 1-4044 (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 
I 1 I I I 1 

ATGACTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC 60 
ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 120 
GGCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG 180 
TGGGGGAAGT ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT 240 
CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 300 
ACCCCGCTCA TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG 360 
TCAGTCCATG ATGCCTCAG A CAAAA ATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 
GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA 480 
ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG 540 
CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 
CATGGAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC 660 
TCCTCCAGTT GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC 720 
TTTGCCTTTG AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 
ATCAGCTTCT TCACCGGTGA TGTA AACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA 840 
GTACTGATCA CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA 900 
TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG 960 
ACAAGAATGG CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT 1020 
GTGACCAGTG AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA 1080 
TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1 140 
ATGGCCTTCA GCATGCTGGC CTCCTTGAAT CTCCTTCGGC TGTCAGTGTT CTTTGTGCCT 1200 
ATTGCAGTCA A AG GTCTCAC GAATTCCAAG TCTGCAGTGA TGAGGTTCAA GAAG11111C 1260 
CTCCAGGAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGGTC 1320 
TTTGAGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 1380 
GAGCTGG AGA GGAACGGGCA TGCTTCTGAG GGGATGACCA GGCCTAGAGA TGCCCTCGGG 1440 
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CCAGAGGAAG AAGGGAACAG CCTGGGCCCA GAGTTGCACA AGATCAACCT GGTGGTGTCC 1500 
AAGGGGATGA TGTTAGGGGT CTGCGGCAAC ACGGGGAGTG GTAAGAGCAG CCTGTTGTCA 1560 
GCCATCCTGG AGGAGATGCA CTTGCTCGAG GGCTCGGTGG GGGTGCAGGG AAGCCTGGCC 1620 
TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATCA GGGAGAACAT CCTCATGGGA 1 680 
5 GGCGCATATG ACAAGGCCCG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATCGGGAC 1740 
CTGGAACTTC TGCCCTTTGG AG ACATGACA GAGATTGGAG AGCGGGGCCT CAACCTCTCT 1 800 
GGGGGGCAGA AACAGAGGAT CAGCCTGGCC CGCGCCGTCT ATTCCGACCG TCAGATCTAC 1860 
CTGCTGGACG ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAG 1920 
TGCATTAAGA AGACACTCAG GGGGAAGACG GTCGTCCTGG TGACCCACCA GCTGCAGTAC 1980 

10 TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGGA AAATCTGTGA AAATGGAACT 2040 
CACAGTGAGT TAATGCAGAA AAAGGGGAAA TATGCCCAAC TTATCCAGAA GATGCACAAG 2100 
GAAGCCACTT CCGACATGTT GCAGGACACA GCAAAGATAG CAGAGAAGCC AAAGGTAGAA 2160 
AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 2220 
CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 2280 

15 TACATCCAGG CAGCTGGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT CGTGGTGCTG 2340 
ATCGTCTTCT TAACGATCTT CAGCTTCTGG TGGCTGAGCT ACTGGTTGGA GCAGGGCTCG 2400 
GGGACCAATA GCAGCCGAGA G AGCAATGGA ACCATGGCAG ACCTGGGCAA CATTGCAGAC 2460 
AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TACGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

„ GTGGGGGTCT GCTCCTCAGG GATTTTCACC AAAGTCACGA GGAAGGCATC CACGGCCCTG 2580 

20 CACAACAAGC TCTTCAACAA GGTTTTCCGC TGCCCC ATGA GTrTCTTTGA C ACCATCCCA 2640 
ATAGGCCGGC TTTTGAACTG CTTCGCAGGG GACTTGGAAC AGCTGGACCA GCTCTTGCCC 2700 
ATCTTTTCAG AGCAGTTCCT GGTCCTGTCC TTAATGGTGA TCGCCGTCCT GTTGATTGTC 2760 
AGTGTGCTGT CTCCATATAT CCTGTTAATG GGAGCCATAA TCATGGTTAT TTGCTTCATT 2820 

_ _ TATTATATGA TGTTCAAGAA GGCCATCGGT GTGTTCAAGA GACTGGAGAA CTATAGCCGG 2880 

25 TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAGCCAGTTT AAGAGGCTGA CTGATGCGCA GAATAACTAC 3000 
CTGCTGTTGT TTCTATCTTC CACACGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 
CTTGTGACCT TGGCTGTTGC CCTGTTCGTG GCTTTTGGCA TTTCCTCCAC CCCCTACTCC 3120 
TTTAAAGTCA TGGCTGTCAA CATCGTGCTG CAGCTGGCGT CCAGCTTCCA GGCCACTGCC 3180 

30 CGGATTGGCT TGGAGACAGA GGCACAGTTC ACGGCTGTAG AGAGGATACT GCAGTACATG 3240 
AAGATGTGTG TCTCGGAAGC TCCTTTACAC ATGGAAGGCA CAAGTTGTCC CCAGGGGTGG 3300 
CCACAGCATG GGGAAATCAT ATTTCAGGAT TATCACATGA AATACAGAGA CAACACACCC 3360 
ACCGTGCTTC ACGGCATCAA CCTGACCATC CGCGGCCACG AAGTGGTGGG CATCGTGGGA 3420 
AGG ACGGGCT CTGGGAAGTC CTCCTTGGGC ATGGCTCTCT TCCGCCTGGT GG AGCCCATG 3480 

35 GCAGGCCGGA TTCTCATTGA CGGCGTGGAC ATTTGCAGCA TCGGCCTGGA GGACTTGCGG 3540 
TCCAAGCTCT CAGTGATCCC TCAAGATCCA GTGCTGCTCT CAGGAACCAT CAGATTCAAC 3600 
CTAGATCCCT TTGACCGTCA CACTGACCAG CAGATCTGGO ATGCCTTGGA GAGGACATTC 3660 
CTGACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA CAGATGTGGT GGAAAACGGT 3720 
GGAAACTTCT CTGTGGGGGA GAGGCAGCTG CTCTGCATTG CCAGGGCTGT GCTTCGCAAC 3780 

40 TCCAAGATCA TCCTTATCGA TGAAGCCACA GCCTCCATTG ACATGGAGAC AGACACCCTG 3840 
ATCCAGCGCA CAATCCGTGA AGCCTTCCAG GGCTGCACCG TGCTCGTCAT TGCCCACCGT 3900 
GTCACCACTG TGCTGAACTG TGACCACATC CTGGTTATGG GCAATGGGAA GGTGGTAGAA 3960 
TTTGATCGGC CGGAGGTACT GCGGAAGAAG CCTGGGTCAT TGTTCGCAGC CCTCATGGCC 4020 

, _ ACAGCCACTT CTTCACTGAG ATAAGGAGAT GTGGAGACTT CATGGAGGCT GGCAGCTGAG 4080 

45 CTCAGAGGTT CACACAGGTG CAGCTTCGAG GCCCACAGTC TGCGACCTTC TTGTTTGGAG 4140 
ATGAGAACTT CTCCTGGAAG CAGGGGTAAA TGTAGGGGGG GTGGGGATTG CTGGATGGAA 4200 
ACCCTGGAAT AGGCTACTTG ATGGCTCTCA AGACCTTAGA ACCCCAGAAC CATCTAAGAC 4260 
ATGGGATTCA GTGATCATGT GGTTCTCCTT TTAACTTACA TGCTGAATAA TTTTATAATA 4320 
AGGTAAAAGC TTATAGTTTT CTGATCTGTG TTAGAAGTGY TGCAAATGCT GTACTGACTT 4380 

50 TGTAAAATAT AAAACTAAGG AAAACTCAAA AAAAAAAAAA AAAAAAA 

SEP ID N0:193 EF06 Protein sequence 

Prolein Accession*: NP_1 15972.1 

55 1 11 21 31 41 51 
I I I I I I 

MTRKRTYWVP NSSGGLVNRG IDIGDDMVSG OYKTYTLQD GPWSQQERNP EAPGRAAVPP 60 
WGKYDAALRT MIPFRPKPRF PAPQPLDNAG LFS YLTVS WL TPLMIQSLRS RLDENTIPPL 120 
, n SVHDASDKNV QRLHRLWEEE VSRRGIEKAS VLLVMLRFQR TRUFDALLG ICFCIASVLG 180 
60 PELIIPKILE YSEEQLGNW HGVGLCFALF LSECVKSLSFSSSWI1NQRT A1RFRAAVSS .240 
FAFEKLIQFK SVIHITSGEA ISFFTGDVNY LFEGVCYGPL VL1TCASLV1 CSISSYFIIG 300 
YTAFIAILCY LLVFPLAVFM TRMAVKAQHH TSEVSDQRK VTSEVLTCIK LIKMYTWEKP 360 
FAK1IEGMES LTFCSKFGDG MAFSMLASLN LLRLSVFFVP IAVKGLTNSK S AVMRFKKFF 420 
LQESPVFYVQ TLQDPSKALV FEEATLSWQQ TCPGWNGAL ELERNGHASE GMTRPRDALG 480 
65 PEEEGNSLGP ELHKINLVVS KGMMLGVCGN TGSGKSSLLS AILEEMHLLE GSVGVQGSLA 540 
YVPQQAWIVS GNIRENILMG GAYDKARYLQ VLHCCSLNRD LEIXPFGDMT EIGERGLNLS 600 
GGQKQR1SLA RAVYSDRQIY LLDDPLSAVD AHVGKHIFEE CIKKTLRGKT WLVTHQLQY 660 
LEFCGQIILL ENGKICENGT HSELMQKKGK YAQLIQKMHK EATSDMLQDT AKIAEKPKVE 720 
SQALATSLEE SLNGNAVPEH QLTQEEEMEE GSLSWRVYHH YIQAAGGYMV SCIIFFFVVL 780 
70 IVFLTIFSFW WLSYWLEQGS GTNSSRESNG TMADLGN1AD NPQLSFYQLV YGLNALLLIC 840 
VGVCSSGIFT KVTRKASTAL HNKLFNKVFR CPMSFFDTD? IGRLLNCFAG DLEQLDQLLP 900 
IFSEQFLVLS LMVIAVLLIV SVLSPYILLM GAHMVICH YYMMFKKAIG VFKRLENYSR 960 
SPLFSHILNS LQGLSSIHVY GKTEDFISQF KRLTDAQNNY UXFLSSTRW MALRLEIMTN 1020 
_ LVTLAVALFV AFGISSTPYS FKVMAVNIVL QLASSFQATA RIGLETEAQF TAVERILQYM 1080 

75 KMCVSEAPLH MEGTSCPQG W PQHGEIIFQD YHMKYRDNTP TVLHG1NLTI RGHEVVGIVG 1140 
RTGSGKSSLG MALFRLVEPM AGRILIDGVD ICSIGLEDLR SKLSVIPQDP VLLSGT1RFN 1200 
LDFFDRHTDQ QIWDALERTF LTKAISKFPK KLHTDVVENG GNFSVGERQL LCIARAVLRN 1260 
SKHLIDEAT AS IDMETDTL IQRTIREAFQ GCTVLVIAHR VTTVLNCDHI LVMGNGKVVE 1320 
FDRPEVLRKK PGSLFAALMA TATSSLR 
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1-1749 (underlined sequences correspond to start and stop codons) 



ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGAC TGCGG CTGGGGGCGC GCAGGACCCT 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 
TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC O 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT C 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA C 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTG CAGCGCCGAG 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC 




G ACCAACTGTT 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC ' 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 
AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 
A GGGAGACGAA GGCAGGGGTG 



GATTCTCGAG 6TGGAAGTCC GCACATGTCG GTGGTATTTA 1 
GCTTCATTTG CCCCCAGACT GTATGAAAAC A 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC A 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA T 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 

TTTTTTTTTT TTTOTTTTAA TCAGACAGGG TCTIGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 

TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTCTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 

GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 

TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC C 
CAAATTCAAG TGAATTTATT T 

3480 
3540 

TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3 6 GO 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
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SEQ ID NO:195 BHBB Protein sequence 

Protein Accession*: none found 

5 

1 11 21 31 41 51 

I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PREPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 
10 SGRQPRGPSD CIPRPPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS DEARGS PGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 3 00 

ARLDQGGCOT DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
_ SPKGRQLLRA FPGLLERARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 
15 LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLEEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 

HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

SEQ ID NO:196 CQA5 DNA SEQUENCE 

20 Nucleic Acid Accession* AA088458 

Coding sequence: 862-1 995 (undefined sequences correspond to start and stop codons) 



GCCCTTGQAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAGGACA CTGACATGGA 60 

CTGAAGAAAA AGGAGCTGGA GCAGGAGAAG GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GCGCGGGGCC GCGACTGGTA CCAGCAGCAG CTGCAACGAG TGCAGGAGCG CCAGCGCCGC 180 

CTGGGCCAGA GCAGAGCCAG CGCCGACTTT GGGGCTGCAG GGAGCCCCCG CCCACTGGGG 240 

CGGCTACTGC CCAAGGTACA AGAGGTGGCC CGGTGCCTGG GGGAGCTGCT GGCTGCAGCC 300 

ACCTCACCCC CGGTCTGGCA G 
CTCACCCAGG AGGTGACCGA G 

GCGCTCATTA AGCAGCTGTT TGAGGCCCGC GCCCTGAGCC AGCAGGACGG GGGACCTCTG 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC GCGTGGGCCC CCAGGGCCAG CCTGGCACTC 600 

AGCCCTTCGA GGGTGGGCGC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

GCCCAGGCAC AGTCCCGGAG TGGGCGCCTT CCTGCCGCCC TTGCCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GGCTGGTCCC CGCACCGAGC GCTTGACTCC GTTTKGGCTC CTGGTTGYTG 780 

ACATGGGCTG GGGGCTCTCT TGAGTCCGCA TAGTCCGCAG CTACTACTGG CCGCTGTCAG 840 

TGGACAGTGG GGTACCCCTC CATGAGTTAG CGTCCCCCCG TTTCCAGCGG TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAG GCACTGCCCA CGCCAGGCTG CAGTTCCAAC AACGGGCAGC 960 

AGAGGGCGCG GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGCTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTG GGGACGGCCT 1080 

GTAAGCGGGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGGA TAGCGGTCGG ACTTCAGGTT 1140 

CTGGCCAAGG CTGAGGGACC CTGGCTGCAG CGGATCGGCA CGCCGGGTGG GCGAGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCCGGGACG 1260 

TTGCCCCACG TTGAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAGC CTGTTTCCCC CGACTCAGGA TTTCCAAGGC 1380 

CTGGGGTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAGCTGG 1440 

GAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAGAACCCCC AGGGTACAGG AGGAGGCTGG 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGG GGCCCTGCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGG CACGGCAACA GCATCGATGG GTTCTGCAGC CCAGGGCCCC CGATGCGGGG 1680 

TCAGTGTGTG TGGGGCGCAG GGCCTCCGAT GCGGGGTCAG TGCGTGGGGG GCGCAGGGCC 1740 

CCCGATGCGG GGTCAGTGCG TGGGGGGCGC AGGGCCCCCT CGTGTCCAGG GCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCCGGAG CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCCACAGAG CCACATTCCC 1920 

TGCTGCACOT GGTCTGCAGG GGTGTCCCAG GACAGGCCCA AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC CCTGAAGATG GGAGTGGGCT TTCCAGGGGA CATAAGGATG TCAGGCCTGG 2040 

ACCTCCTGGG CAGGAAAGGG TGCAGGTCCT GAGGGCCTGT GCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTG GGTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCCGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA CGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCCGG 22 80 

GGTGACTTCA TCAGGAGACC GCCCACATAG AGCTGGACCC CGCAGCTGAA GCGGAAATGT 2340 

GAGACAGGCT GGCACCTCCG GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 2400 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGGAGAC 2460 

CACGAGGGGA GAATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG CCTGGAGCCT GCCCTAGGAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 2580 

GAGOAGCGTC CCTGGGCTCT ATCCGCGAGG TCCCAGTAGC GTGTGCAGGT ACATACACGT 2640 

GCGTGCACAC TGTGATGACA CCCGGAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

CAGAAGTGTC CCCAGTTGAG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAGGAAAAGG AACATCTCAG CCGGGCGTGG TGGTTCACGC 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGTT CCAGAGCAGC CTGGGCAACG CAGTGAGAGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAGACCCAGA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AGAAATAAAA GAGATTTCTG GAAACATGAA AAAAAA 
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SEQ ID NO;197 LBG2 DNA SEQUENCE 

Nucleic Acid Accession* X63629 

Coding sequence; 54-2543 (start and slop codons are underlined) 

5 1 11 21 31 41 51 

I I I I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATG.GGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTC CAGTGCGCGG 110 

„ . CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 

10 CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTG AAGGAA AGG AATCCAT TGAAGATCTT CCCATCCA AA CGTATCTTAC 3 60 
GAAG ACACAA G AGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

15 ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

20 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 

25 GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1 140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 

nn CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TITGTCCCAC 1380 

30 CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGXCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

35 ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

40 ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTCTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GG AGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 

45 TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TICGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACCKjTGGCGGGGAGGACGACTAG^CGGCCTGCCTXjCAGGGCTGGGGACCAAACGTCAGGC 2580 

50 CACAG AGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGG AG ACA GGCTATG AGT CTG ACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGOGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

55 CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAA AGTT CTTC AAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTC CCATTC GGATGGATCT CTGCGTTnT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 

60 

SEQ ID NO:198 LBG2 Protein seouence: 

Protein Accession (f: CAA45177 

65 

1 11 21 31 41 51 

MGLPRGPLAS LlLqVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NFLKIFPSKR ILRRHKRDWV VAPIS VPENG 120 

70 KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NG AS VEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTS VMQVTAT 240 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAWEDLD ANDNAFMFDP QKYEAHVPEN AVGHEVQRLT VTOLDAPNSP 360 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

75 PTSTATIVVH VEDVNEAPVF VPPSKVVEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPFTTG TGTLLLTUD 540 
VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RK1KEPLLLP EDDTRDN VFY YGEEGGGEED QDYDITQLHR 720 
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SEQ ID NO:199 OBI5 DNA SEQUENCE 

NMJJ12152 

43-1104 (underlined sequences correspond to start and stop codoi 



CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 
GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 
GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 
TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 
TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 
ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 
GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 
AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 
CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 
TGGAATTGCC T 



TACCTTGTTT TCTGGAC 
CTGCGGATCT ACGTGTA 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA C 
GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 
AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 
GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 
ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 
GTCCICAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 
GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 

SEQ ID NO:200Qrjl5 P^ln SEffltSDSS 



1 11 21 31 41 51 

I I I I I I 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL V 
FHFPFYYLIA MLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 
LVIAVERHMS IMRMRVHSNIj TKKRVTLLIL LVWAIAIFMG AVPTLGWNCIi CNISACSSLA 
PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 
VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNKSTS 

SEQ ID NO:201 PAA6 DNA SEQUENCE 



1 11 21 31 41 51 

I I I I I I 

ATGACCTACA GTTACTCATT TTTCAGGCCT GAGTTGATCG TTAATCATCT TAATTATGTT 
CATTCTGAAG CCAACAGGAG AACCAAGACC AAAACTTTAT TGTCTCTGCT TTCATTTCTT 
GATGAAACCT CTGGACTAAG CACACATCTT CCTTGTTTAT CTCTCTCAAA GGAGTGTGGA 
GTGCTTCATC TGGACATCCA CGGGAAGAAG GAAGACATGA GAATCACCCA ACAGTCTTCC 
CAGCTATACC TGTGGGACAT GGCTGGTTTT ACAATATTTA AGAACCTGTG GATGAGCCTC 
ATACCCAGAG GGAACAAACG CTCCCCAAAA AGAGTTACAG AAACCATCCT GAGAGATTTT 
AAGCAGAAGC AAAGTTCAAA GATCCAAGAG GAGAGACGAA GAGAGTCTGC AGGACCAAAC 
CTCTCTTCAT TCTGGTTTGT GGGGAATGCT GGAAGAGGAG ACAGGCCCCA GATTTGGGCA 
GGAAGTAAAC AGTTTTCAGG CTGAGGCCAA TCTGAGCAGG AACATTCCAA TATTTCTTCA 
GCTACGTTGT CCCAGCACTT CACTGGTTAA CCTTTTATGT CCACCATTTG TGGATTTCAC 
AGCTACTTGT CAATGGTGAA TATTGATCAT CATCATTATC TACTGAGCTG CTACCATATS 
CCAGCTACTC CTTGCATGTT GTTCATTATT TTCTCAACAC TCAGCATATT TGCAATATGT 
TATGTAATAT CACAGACAAG GAAACTGAAC GCAGAAATGT TTTATTTCTT GCCAAACATC 
ACATGAGGAT GAACAATGAA ACCGATTTGA AACCAGGATT GTCTGATTCC AACATCTCTG 
GGTCCTTTTT CACTCTGATA TGCTGCAATT AAAAAGCCAT TTCTAAGACT GT 



MTYSYSFFRP ELIVNHLNYV HSEANRRTKT KTLLSLLSFI, DETSGLSTHL PCLSLSKECG 
VLHLDIHGKK EDMRITQQSS QLYLWDMGGF TIFKNLWKSL IPRGNKRSPK RVTETILRDF 
KQKQSSK1QE ERRRESAGPN LSSFWFVGNA GRGDRPQIWA GSKQFSG 
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SEQ ID NO:203 PAB2 DNA SEQUENCE 

Nucleic Acid Accession*: XM.050197 

Coding sequence: 31 0-1 971 (underlined sequences correspond to start and stop codons) 

1 ■ 11 21 31 41 SI 

1 I I I I I 

TCACACGTGC CAAGGGGCTG GCTCAGCGGA ACCAGCCTCC ACGCGCTGGC TCCGGGTGAC 

AGCCGCGCGC CTCGGCCAGG ATCTGAGTGA TGAGACGTGT CCCCACTGAG GTGCCCCACA 
GCAGCAGGTG TTGAGCATGG GCTGAGAAGC TGGACCGGCA CCAAAGGGCT GGCAGAAATG 

GGCGCCTGGC TGATTCCTAG GCAGTTGGCG GCAGCAAGGA GGAGAGGCCG CAGCTTCTGG 

AGCAGAGCCG AGACGAAGCA GTTCTGGAGT GCCTGAACGG CCCCCTGAGC CCTACCCGCC 

TGGCCCAC TA TGG TCCAGAG GCTGTGGGTG AGCCGCCTGC TGCGGCACCG GAAAGCCCAG 

CTCTTGCTGG TCAACCTGCT AACCTTTGGC CTGGAGGTGT GTTTGGCCGC AGGCATCACC 

TATGTGCCGC CTCTGCTGCT GGAAGTGGGG GTAGAGGAGA AGTTCATGAC CATGGTGCTG 

GGCATTGGTC CAGTGCTGGG CCTGGTCTGT GTCCCGCTCC TAGGCTCAGC CAGTGACCAC I 

TGGCGTGGAC GCTATGGCCG CCGCCGGCCC TTCATCTGGG CACTGTCCTT GGGCATCCTG i 

CTGAGCCTCT TTCTCATCCC AAGGGCCGGC TGGCTAGCAG GGCTGCTGTG CCCGGATCCC 

AGGCCCCTGG AGCTGGCACT GCTCATCCTG GGCGTGGGGC TGCTGGACTT CTGTGGCCAG 

GTGTGCTTCA CTCCACTGGA GGCCCTGCTC TCTGACCTCT TCCGGGACCC GGACCACTGT 



CTGCCTGCCA TTGACTGGGA CACCAGTGCC CTGGCCCCCT ACCTGGGCAC CCAGGAGGAG 
TGCCTCTTTG GCCTGCTCAC CCTCATCTTC CTCACCTGCG TAGCAGCCAC ACTGCTGGTG 
GCTGAGGAGG CAGCGCTGGG CCCCACCGAG CCAGCAGAAG GGCTGTCGGC CCCCTCCTTG 
TCGCCCCACT GCTGTCCATG CCGGGCCCGC TTGGCTTTCC GGAACCTGGG CGCCCTGCTT 

GAGCTGTGCA GCTGGATGGC ACTCATGACC TTCACGCTGT TTTACACGGA TTTCGTGGGC 
GAGGGGCTGT ACCAGGGCGT GCCCAGAGCT GAGCCGGGCA CCGAGGCCCG GAGACACTAT 
GATGAAGGCG TTCGGATGGG CAGCCTGGGG CTGTTCCTGC AGTGCGCCAT CTCCCTGGTC 
TTCTCTCTGG TCATGGACCG GCTGGTGCAG CGATTCGGCA CTCGAGCAGT CTATTTGGCC 
AGTGTGGCAG CTTTCCCTGT GGCTGCCGGT GCCACATGCC TGTCCCACAG TGTGGCCGTG 
GTGACAGCTT CAGCCGCCCT CACCGGGTTC ACCTTCTCAG CCCTGCAGAT CCTGCCCTAC 
ACACTGGCCT CCCTCTACCA CCGGGAGAAG CAGGTGTTCC TGCCCAAATA CCGAGGGGAC 
ACTGGAGGTG CTAGCAGTGA GGACAGCCTG ATGACCAGCT TCCTGCCAGG CCCTAAGCCT 
GGAGCTCCCT TCCCTAATGG ACACGTGGGT GCTGGAGGCA GTGGCCTGCT CCCACCTCCA 



630 



ACCGAGGCCA GGGTGGTTCC GGGCCGGGGC ATCTGCCTGG ACCTCGCCAT CCTGGATAGT 
GCCTTCCTGC TCTCCCAGGT GGCCCCATCC CTGTTTATGG GCTCCATTGT CCAGCTCAGC 
CAGTCTGTCA CTGCCTATAT GGTGTCTGCC GCAGGCCTGG GTCTGGTCGC CATTTACTTT 
GCTACACAGG TAGTA.TTTGA CAAGAGCGAC TTGGCCAAAT ACTCAGCG TA GA AAACTTCC 

AGCACATTGG GGTGGAGGGC CTGCCTCACT GGGTCCCAGC TCCCCGCTCC TGTTAGCCCC 2040 

ATGGGGCTGC CGGGCTGGCC GCCAGTTTCT GTTGCTGCCA AAGTAATGTG GCTCTCTGCT 2100 

GCCACCCTGT GCTGCTGAGG TGCGTAGCTG CACAGCTGGG GGCTGGGGCG TCCCTCTCCT 2160 

CTCTCCCCAG TCTCTAGGGC TGCCTGACTG GAGGCCTTCC AAGGGGGTTT CAGTCTGGAC 2220 

TTATACAGGG AGGCCAGAAG GGCTCCATGC ACTGGAATGC GGGGACTCTG CAGGTGGATT 2280 

ACCCAGGCTC AGGGTTAACA GCTAGCCTCC TAGTTGAGAC ACACCTAGAG AAGGGTTTTT 2340 

GGGAGCTGAA TAAACTCAGT CACCTGGTTT CCCATCTCTA AGCCCCTTAA CCTGCAGCTT 2400 

CGTTTAATGT AGCTCTTGCA TGGGAGTTTC TAGGATGAAA CACTCCTCCA TGGGATTTGA 2460 

ACATATGAAA GTTATTTGTA GGGGAAGAGT CCTGAGGGGC AACACACAAG AACCAGGTCC 2520 

CCTCAGCCCC ACAGGCACTG GTCTTTTTTG CTNGANTCCA CCCCCCCCCT CTTTACCCTT 2580 



1 11 21 31 41 51 

I I I I I I 

HVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PIALEVGVEE KFMTMVLGIG 60 
60 PVLGLVCVPL LGSASDHWRG RYGRRRPFIW ALSLGILLSL FLIPRAGWLA GLLCPDPRPL 120 

ELALLILGVG LU3FCGQVCF TPLEALLSDL FRDPDHCRQA YSVYAFMISL GGCLGYLLPA 180 
IDWDTSALAP YLGTQEECLF GLLTLIFLTC VAATLLVAEE AALGPTEPAE GLSAPSLSPH 240 
CCPCRARLAF RNLGALLPRL HQLCCRMPRT LRRLFVAELC SWMALMTFTL FYTDFVGEGL 300 
YQGVPRAEPG TEARRHYDEG VRMGSLGLFL QCAISLVFSL VMDRLVQRFG TRAVYLASVA 360 
65 AFPVAAGATC LSSSVAWTA SAALTGFTFS ALQILPYTEiA SLYHREKQVF LPKYRGDTGG 420 

ASSEDSLMTS FLPGPKPGAP F PNGHVGAGG SGLLPPFPAL CGASACDVSV RVWGEPTEA 480 
RWPGRGICL DLA1LDSAFL LSQVAPSLFM GSIVQLSQSV TAYMVSAAGL GLVAIYFATQ 540 
WFDKSDLAK YSA 

70 SEQ ID NO:205 PAJ3 DNA SEQUENCE 

Nucleic Acid Accession*: AKO02126 

Coding sequence: 1-1593 (underlined sequences correspond lo start and stop codons) 

75 1 11 21 31 41 

I I I I I I 

ATGG1TCGCC GGGGGCTGCT TGCGTGGATT TCCCGGGTGG TO3TTTTGCT GGTGCTCCTC 60 
A TCTCTGTCCT GTACATGTTG GCCTGCACCC CAAAAGGTGA CGAGGAGCAG 120 
;ttcag 180 



GAGTGGGAGG AGCAGCACCG CAACTACGTG AGCAGCCTGA AGCGGCAGAT 
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20 
25 

30 
35 
40 
45 
50 
55 
60 



G CTCAGGAATG GGCAGTACCA AGCCAGCGAT 
C TGGGTCTGGA CAGGAGCCCC CCAGAGAAAA CCCAGGCCGA CCTCCTGGCC 
TTCCTGCACT CGCAGGTGGA CAAGGCAGAG GTGAATGCTG GCGTCAAGCT GGCCACAGAG 
TATGCAGCAG TGCCTTTCGA TAGCTTTACT CTACAGAAGG TGTACCAGCT GGAGACTGGC 
CTTACCCGCC ACCCCGAGGA GAAGCCTGTG AGGAAGGACA AGCGGGATGA GTTGGTGGAA 
C AATCCTGCAG AGAACAGCCC CAATCACCGT 



TTGTATGAGC TCACCTTCAA AGGGGACCAC 

3AAAAGC TCAACATGGC CAACACGCTT 780 

3 GTGGACAAGT TCCGGCAGTT CATGCAGAAT 840 

3TCCATC TCACTGTTGT TTACTTTGGG 900 

AAAGAAGARA TAAATGAAGT CAAAGGAATA CTTGAAAACA CTTCCAAAGC TGCCAACTTC 960 

AGGAACTTTA CCTTCATCCA GCTGAATGGA GAATTTTCTC GGGGAAAGGG ACTTGATGTT 1020 

GGAGCCCGCT TCTGGAAGGG AAGCAACGTC CTTCTCTTTT TCTGTGATGT GGACATCTAC 1080 

TTCACATCTG AMTCCTCAA TACGTGTAGG CTGAATACAC AGCCAGGGAA GAAGGTATTT 1140 

TATCCAGTTC TTTTCAGTCA QTACAATCCT GGCATAATAT ACGGCCACCA TGATGCAGTC 1200 

CCTCCCTTGG AACAGCAGCT GGTCATAAAG AAGGAAACTG GATTTTGGAG AGACTTTGGA 1260 

TTTGGGATGA CGTGTCAGTA TCGGTCAGAC TTCATCAATA TAGGTGGGTT TGATCTGGAC 1320 

ATCAAAGGCT GGGGCGGAGA GGATGTGCAC CTTTATCGCA AGTATCTCCA CAGCAACCTC 1380 

ATAGTGGTAC GGACGCCTGT GCGAGGACTC TTCCACCTCT GGCATGAGAA GCGCTGCATG 1440 

GACGAGCTGA CCCCCGAGCA GTACAAGATG TGCATGCAGT CCAAGGCCAT GAACGAGGCA 1500 

TCCCACGGCC AGCTGGGCAT GCTGGTGTTC AGGCACGAGA TAGAGGCTCA CCTTCGCAAA 1560 
CAGAAACAGA AGACAAGTAG CAAAAAAACA TGA 

gEq |D N0;206 PAJ3 Prpigin seqgence; 



U.PRANSPT GKEGYQAVLQ 
EWEEQHRNYV SSLKRQIAOL KEELQERSEQ LRNGQYQASD AAGLGLDRSP PEKTQADLLA 
PLHSQVDKAE VNAGVKLATE YAAVPFDSFT LQKVYQLETG LTRHPEEKPV RKDKHDELVE 
AIESALETLN NPAENSPNHR PYTASDFIEG IYRTERDKGT LYE LTFKGDH KHEFKRLILF 
RPFGPIMKVK NEKLNMAHTL INVIVPLAKR VDKFRQFMQN FREMCIEQDG RVHLTWYFG 

KEEINEVKGI LEBTSKAANF RNFTFIQLNG EFSRGKGLDV GARFWKG 

FTSEFLNTCR LNTQPGKKVF YPVLFSQYNP GIIYGHHDAV PPLEQQLVIK K 
FGJTTCQYRSD FINIGGFDLD IKGWGGEDVH LYRKYLHSNL IWRTP\ 
DELTPEQYKM CMQSKAMNEA SHGQLGMLVF RHEIEAHLRK QKQKTSSKKT 

SEQ ID NO:207 PAJ5 DNA SEQUENCE 

Nucleic Acid Accession*: AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGATTCCIG TATTGACATC AAAAAAAGCA AGTGAATTAC CAGTCAGTGA AGTTGCAAGC 
ATTCTCCAAG CTGATCTTCA GAATO3TCTA AACAAATGTG AAGTTAGTCA TAGGCGAGCC 
TTTCATGGCT GGAATGAGTT TGATATTAGT GAAGATGAGC CACTGTGGAA GAAGTATATT 
TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCAGTCAT CAGTGTTTTA 
ATGCATCAGT TTGATGATGC CGTCAGTATC ACTGTGGCAA TACTTATCGT T 1 
GCCTTTGTTC AGGAATATCG TTCAGAAAAA TCTCTTGAAG AATTGAC 



480 



GAGGCTGTGG ATCTTTCCAT TGATGAGTCC AGCTTGACAG GTGAGACAAC GCCTTGTTCT 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATCGAG AAGTAACATT 600 

GCCTTTATGG GAACACTGGT CAGATGTGGC AAAGCAAAGG GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGGTA-PA 780 

ATAGGAATCA TCATGTTGGT TGGCTGGTTA CTGGGAAAAG ATATCCTGGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTOT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 900 

- _ ACGCTAGCTC TTGGTGTTAT GAGAATGGTG AAGAAAAGGG CCATTGTGAA AAAGCTGOCT 960 

OD ATTGTTGAAA CTCTGGGCTG CTGTAATGTG ATTTGTTCAG ATAAAACW3G AACACTGACG 1020 

AAGAATGAAA TGACTGTTAC TCACATATTT ACTTCAGATG GTCTGCATGC TGAGGTTACT 1080 

GGAGTTGGCT ATAATCAATT TGGGGAAGTG ATTGTTGATG GTGATGTOGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTGTG TGTGCAATGA TGCTGTAATT 12 00 

AGAAACAATA CTCTAATGGG GAAGCCAACA GAAGGGGCCT TAATTGCTCT TGCAATGAAG 1260 
7U ATGGGTCTTG ATGGACTTCA ACAAGACTAC AT 



T TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAGGCACGC 1500 

ATGGGCTCAG CGGGACTCAG AGTTCTTGCT TTGGCTTCTO GTCCTGAACT GGGACAGCTG 1560 

ACATTTCTTG GCTTGGTGGG AATCATTGAT CCACCTAGAA CTGGTGTGAA AGAAGCTGTT 1620 

ACAACACTCA TTGCCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAGGAGACT 1680 

GCAGTTGCAA TCGCCAGTCG TCTGGGATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TGTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 18 00 

TTTTACAGAG CTAGCCCAAG GCACAAGATG AAAATTATTA AGTCGCTACA GAAGAACGGT 1860 

TCAGTTGTAG CCATGACAGG AGATGGAGTA AATGATGCAG TTGCTCTGAA GGCTGCAGAC 1920 



387 



WO 02/30268 



PCT/US01/32045 



ATTGGAGTTG CGATGGGCCA GACTGGTACA GATGTTTGCA AAGAGGCAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGGGTAA AGGGATTTAT 2040 

AATAACATTA AAAATTTCGT TAGATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 

ATCTCATTGG CTACATTAAT GAACTTTCCT AATCCTCTCA ATGCCATGCA GATTTTGTGG 2160 

ATCAATATTA TTATGGATGG ACCCCCAGCT CAGAGCCTTG GAGTAGAACC AGTGGATAAA 2220 

GATGTCATTC GTAAACCTCC TCGCAACTGG AAAGACAGCA TTTTGACTAA AAACTTGATA 2280 

CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CTTTGTTTGT CTTCTGGCGT 2340 

GAGCTACGAG ACAATGTGAT TACACCTCGA GACACAACAA TGACCTTCAC ATGCTTTGTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAGA CCAAGTCTGT GTTTGAGATT 2460 

GGACTCTGCA GTAATAGAAT GTTTTGCTAT GCAGTTCTTG GATCCATCAT GGGACAATTA 2520 

CTAGTTATTT ACTTTCCTCC GCTTCAGAAG GTTTTTCAGA CTGAGAGCCT AAGCATACTG 2580 

GATCTGTTGT TTCTTTTGGG TCTCACCTCA TCAGTGTGCA TAGTGGCAGA AATTATAAAG 2640 

AAGGTTGAAA GGAGCAGGGA AAAGATCCAG AAGCATGTTA GTTCGACATC ATCATCTTTT 2700 
CTTGAAGT AT GA 

SESJB NQ:2Q8 PAJ5 Protein SSmm 



I I I I 

k SEMVSEVAS ILQADLQNGL NKCEVSHRSA FHGWNEFDIS EDEPLWKKYI 
SQFKNPLIML LLASAVISVL MHQFDDAVSI TVAILIWTV AFVQEYRSEK SLEELSKLVP 
PECHCVREGK LEHTLARDLV PGDTVCLSVG DRVPADLRLF EAVDLSIDES SLTGETTPCS 
KVTAPQPAAT KGDLASBSNI AFMGTLVRCG KAKGW1GTG ENSEFGEVFK MMCAEEAPKT 
PLQKSMDLLG KQLSFYSFGI IGIIMLVGWL LGKDILEMFT ISVSLAVAAI PEGLPIWTV 

TLALGVMRMV KXRAIVKKLP IVETLGCCNV ICSDKTGTLT KNEMTVI 

GVGYNQFGEV 1VDGDWHGF YNPAVSRIVE AGCVCNDAVI RNNTLMG 
MGLDGLQQDY 1RKAEYPFSS EQKWMAVKCV HRTQQDRPEI CFMKGAYEQV I 

GQTLTLTQQQ RDVYQQEKAR MGSAGLRVLA LASGPELGQL TFLGLVG 

TTLIASGVSI KMITGDSQET AVAIASRLGL YSKTSQSVSG EEIDAMDVQQ LSQIVFKVAV 600 
FYRASPRHKM KIIKSLQKNG SWAMTGDGV NDAVALKAAD IGVAMGQTGT DVCKEAADMI 660 
LVDDDFQTIM SAIEEGKGIY NNIKNFVRFQ LSTSIAALTL ISLATLMNFP NPLNAHQILW 720 
INIIMDGPPA QSLGVEPVDK DVIRKPPRNW KDSILTKNLI LK1LVSSIII VCGTLFVFWR 780 
ELRDNVITPR DTTMTFTCFV FFDMFNALSS RSQTKSVFEI GLCSNRMFCY AVLGSIMGQL 840 
LVIYFPPLQK VFQTESLSII. DLLFLLGLTS EVC1VAEIIK KVERSREKIQ KHVSSTSSSF 900 

SEQ ID NO:209 PAV4 VARIANT 1 DNA SEQUENCE 

Nucleic Acid Accession*: N62096 

Coding sequence: 1-1284 (underlined sequences correspond lo start and stop codons) 



1 11 21 31 41 51 

45 ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGGATTGCC OTATTCAATG . 60 

AAGCAAGCTG GGTTTCCTTT GGGAATATTG CTTTTATTCT GGGTTTCATA TGTTACAGAC 120 

TTTTCCCTTG TTTTATTGAT AAAAGGAGGG GCCCTCTCTG GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGG CTTTCCAGGG TATCTGCTCC TCTCTGTTCT TCAGTTTTTG 240 

_ TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG GAGATACTTT GAGCAAAGTT 300 

50 TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTGTTTA TTGGTCGCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTGCCT TTATCCTTGT ACCGAAATAT AGCAAAGCTT 420 

GGAAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATGCCA TTCAAGCGGT CGGGGTTATG TCTTTTGCAT TTATTTGCCA CCATAACTCC 600 

55 TTCTTAGTTT ACAGTTCTCT AGAAGAACCC ACAGTAGCTA AGTGGTCCCG CCTTATCCAT 660 

ATGTCCATCG TGATTTCTGT ATTTATCTGT ATATTCTTTG CTACATGTGG ATACTTGACA 720 

TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATGA TGACCTGGTA 780 

ACATTTGGAA GATTTTGTTA TGGTGTCACT GTCATTTTGA CATACCCTAT GGAATGCTTT 840 

. GTGACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GGTTTTCCAC 900 

60 ATTGTTGTAA CAGTGATGGT CATCACTGTA GCCACGCTTG TGTCATTGCT GATTGATTGC 960 

CTCGGGATAG TTCTAGAACT CAATGGTGTG CTCTGTGCAA CTCCCCTCAT TTTTATCATT 1020 

CCATCAGCCT GTTATCTGAA ACTGTCTGAA GAACCAAGGA CACACTCCGA TAAGATTA'fG 1080 

TCTTGTGTCA TGCTTCCCAT TGGTGCTGTG GTGATGGTTT TTGGATTCGT CATGGCTATT 1140 

, _ ACAAATACTC AAGACTGCAC CCATGGGCAG GAAATGTTCT ACTGCTTTCC TGACAATTTC 1200 

65 TCTCTCACAA ATACCTCAGA GTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 1260 
ATTAGTATCT TTCAACTCGA GTAA 



gEQIDNO:210PAV4V?ri; 
Protein Accession*: 



GLSTVTFTLP LSLYRNIAKL GKVSLISTGL TTLILGIVMA RAISLGPHIP KTEDAWFAK 
PNAIQAVGVM SFAFICHHNS PLVYSSLEEP TVAKWSRLIH MSIVISVFIC IFFATCGYLT 
FTGFTQGDLF ENYCRNDDLV TFGRFCYGVT VILTYPMECF VTREVIANVF FGGNLSSVFH 
IWTVKVITV ATLVSLLIDC LGIVLELNGV LCATPLIFII PSACYLKLSE EPRTHSDKIM 
SCVMLPIGAV VMVFGFVMAI TNTQDCTHGQ EHFYCFPDNF SLTNTSESHV QQTTQLSTLN 



WO 02/30268 



PCT/USO 1/32045 



SEQ ID NO:2t1 PAV4 VARIANT 2 DNA SEQUENCE 

Nucleic Acid Accessions N62096 

Coding sequence: 1-1203 (undertined sequences correspond to slart and slop codons) 

1 11 21 31 41 51 

I I I I I 'I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGT TTTCCCTTGT TOTATTGATA 60 

AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 

TTTCCAGGGT ATCTGCTCCT CTCTGTTCTT CAGTTTTTGT ATCCTTTTAT AGCAATGATA 180 

AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 240 

GATCCTGAAA ACGTGTTTAT TGGTCGCCAC TTCATTATTG GACTTTCCAC AGTTACCTTT 3 00 

ACTCTGCCTT TATCCTTGTA CCGAAATATA GCAAAGCTTG GAAAGGTCTC CCTCATCTCT 3 60 

ACAGGTTTAA CAACTCTGAT TCTTGGAATT GTAATGGCAA GGGCAATTTC ACTGGGTCCA 420 

CACATACCAA AAACAGAAGA CGCTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 480 

GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TOTTAGTTTA CAGTTCTCTA 540 

GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 600 

TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT CACCCAAGGG 660 

GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 720 

GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA GGTAATTGCC 780 

AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 900 

AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 1080 

CATGGGCAGG AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 

TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 1200 



3 FFGYLLLSVL QFLYPFIAMI 



Nucleic Acid Accession*: 



1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 
CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 
TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 
CCTGAAAACG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 
CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 
GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT G 
ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA A 
GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 
GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 
ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 
TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 
GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 
GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 
ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTCCCTCG GGATAGTTCT AGAACTCAAT 840 
GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 
TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 
GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 
GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 
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MGYQRQEFVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS YNI1AGDTLS KVFQRIPGVD 60 

PENVFIGRHF IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAISLGPH 120 

IPKTEDAWVF AKPNAI QAVG VMSFAFICHH NSFWYSSLE EPTVAKWSRL IHMSIVISVF 180 

ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIFSACYLKL 300 

SEEPRTHSDK IMSCVMLPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD HFSLTNTSES 360 
HVQQ1TQLST LNISIFQLE 

SEQ ID HO:215 PAV4 VARIANT4 DNA SEQUENCE: 

Nucleic Acid Accession*: N6209S 

Coding sequence: 1-1388 (underlined sequences correspond to start and stop cottons) 

l 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGATTTAGA TCACAGAGAA 60 

ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTGTCAACT CGATTATAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 180 

GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACAGA CTTTTCCCTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 300 
3 GCTTTCCAGG G 



ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 540 

TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTGGAA TTGTAATCGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 720 

TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATGTCCATC 780 

GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 840 

TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAATG ATGACCTGGT AACATTTGGA 900 

AGATTTTGTT ATCGTGTCAC TGTCATOTTG ACATACCCTA TGGAATGCTT TGTGACAAGA 960 

GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 1020 

ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 1080 

GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 1140 

TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 1200 

ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 1260 

CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 1320 

AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 1380 
TTTCAATGA 

SEQ ID NO:216 PAV4 Variant 4 Protein seoufince: 



1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQRDLDDRE TLVSEHEYKE KTCQSAALFN WNSIIGSGI IGLPYSMKQA 60 

GFPLGILLLF WVSYVTDFSL VLLIKGGALS GTDTYQSLVN KTFGFFGYLL LSVLQFLYPF 120 

IAMISYNIIA GDTLSKVFQR IPGVDPENVF IGRKFIIGLS TVTFTLPLSL YRNIAKLGKV 180 

SLISTGLTTL ILGIVMARAI SLGPHIPKTE DAWVFAKPNA IQAVGVMSFA FICHHNSFLV 240 

YSSLEEPTVA KWSRLIHMSI VISVFICIFF ATCGYLTFTG FTQGDWENY CRNDDLVTFG 300 

RFCYGVTVIL TYPMECFVTR EVIANVFFGG NLSSVFHIW TVMVITVATL VSLLIDCLGI 360 

VLEUJGVLCA TPLIFIIPSA CYLKLSEEPR THSDKIMSCV MLPIGAWMV FGFVMAIWJT 420 
QDCTHGQEMF YCFPDNFSLT NTSESHVQQT TQLSTLNISI FQ 

SEQ ID NO:217 PAV9 DNA SEQUENCE 

Nucleic Acid Accession*: NM_017636 

Coding sequence: 1-3501 (underlined sequences correspond to start and stop codons) 



CCTTCGGGGC AGCCGTGGTG ACCGTCTGGG ACAGCGATGC ACACACCACG 

AATTTCCTCC GGCTCTCTCA CCGAACGGAT CCAGCTGCAG TTTATAGTCT GGTCACACGC 
ACATGGGGCT TCCGTGCCCC GAACCTGGTG GTGTCAGTGC TGGGGGGATC GGGGGGCCCC 
GTCCTCCAGA CCTGGCTGCA GGACCTGCTG CGTCGTGGGC TGGTGCGGGC TGCCCAGAGC 
ACAGGAGCCT GGATTGTCAC TGGGGGTCTG CACACGGGCA TCGGCCGGCA TGTTGGTGTG 
GCTGTACGGG ACCATCAGAT GGCCAGCACT GGGGGCACCA AGGTGGTGGC CATGGGTGTG 
3 GTGTGGTCCG GAATAGAGAC ACCCTCATCA ACCCCAAGGG CTCGTTCCCT 
C GGTGGCGCGG TGACCCGGAG GACGGGGTCC AGTTTCCCCT GGACTACAAC 
TACTCGGCCT TCTTCCTGGT GGACGACGGC ACACACGGCT GCCTGGGGGG 0 
TTCCGCTTGC GCCTGGAGTC CTACATCTCA CAGCAGAAGA CGGGCGTGGG A 
ATTGACATCC CTGTCCTGCT CCTCCTGATT GATGGTGATG AGAAGATGTT GACGCGAATA 
GAGAACGCCA CCCAGGCTCA GCTCCCATGT CTCCTCGTGG CTGGCTCAGG GGGAGCTGCG 
GACTGCCTGG CGGAGACCCT GGAAGACACT CTGGCCCCAG GGAGTGGGGG AGCCAGGCAA 
GGCGAAGCCC GAGATCGAAT CAGGCGTTTC TTTCCCAAAG GGGACCTTGA GGTCCTGCAG 
GCCCAGGTGG AGAGGATTAT GACCCGGAAG GAGCTCCTGA CAGTCTATTC TTCTGAGGAT 
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CTCATGGACG CCCTGCTGAA TGACCGGCCT G 
CTCAGCCTGG GCCACTTCCT GACCCCGATG CGCCTGGCCC A 
TCCAACTCGC TCATCCGCAA C 
CCAGCCCTAA A 

ATGCTGCTGG GGAAGATGTG CGCGCCGAGG T 
CCAGGCCAGG GCTTCGGGGA GAGCATGTA 



GCAGCACGGA GGAAAGACCT GGCGTTCAAG TTTGAGGGGA T 
GAGTGCTATC GCAGCAGTGA GGTGAGGGCT GCCCGCCTCC T 

TGGGGGGATG CCACTTGCCT CCAGCTGGCC ATGCAAGCTG ACGCCCGTGC CTTCTTTGCC 1860 

CAGGATGGGG TACAGTCTCT GCTGACACAG AAGTGGTGGG GAGATATGGC CAGCACTACA 192 0 

CCCATCTGGG CCCTGGTTCT CGCCTTCTTT TGCCCTCCAC TCATCTACAC CCGCCTCATC 1980 

ACCTTCAGGA AATCAGAAGA GGAGCCCACA CGGGAGGAGC TAGAGTTTGA CATGGATAGT 2040 

GTCATTAATG GGGAAGGGCC TGTCGGGACG GCGGACCCAG CCGAGAAGAC GCCGCTGGGG 2100 

GTCCCGCGCC AGTCGGGCCG TCCGGGTTGC TGCGGGGGCC GCTGCGGGGG GCGCCGGTGC 2160 



2340 



CTGCGCCAGG GCCTGAGCGG AGGCGGGGGC AGCCTCGCCA GCGGGGGCCC CGGGCCTGGC 2400 

CATGCCTCAC TGAGCCAGCG CCTGCGCCTC TACCTCGCCG ACAGCTGGAA CCAGTGCGAC 2460 

CTAGTGGCTC TCACCTGCTT CCTCCTGGGC GTGGGCTGCC GGCTGACCCC GGGTTTGTAC 252 0 

CACCTGGGCC GCACTGTCCT CTGCATCGAC TTCATGGTTT TCACGGTGCG GCTGCTTCAC 2580 

ATCTTCACGG TCAACAAACA GCTGGGGCCC AAGATCGTCA TCGTGAGCAA GATGATGAAG 2640 

GACGTGTTCT TCTTCCTCTT CTTCCTCGGC GTGTGGCTGG TAGCCTATGG CGTGGCCACG 2700 

GAGGGGCTCC TGAGGCCACG GGACAGTGAC TTCCCAAGTA TCCTGCGCCG CGTCTTCTAC 2760 

CGTCCCTACC TGCAGATCTT CGGGCAGATT CCCCAGGAGG ACATGGACGT GGCCCTCATG 2820 

GAGCACAGCA ACTGCTCGTC GGAGCCCGGC TTCTGGGCAC ACCCTCCTGG GGCCCAGGCG 2880 

GGCACCTGCG TCTCCCAGTA TGCCAACTGG CTGGTGGTGC TGCTCCTCGT CATCTTCCTG 2940 

CTCGTGGCCA ACATCCTGCT GGTCAACTTG CTCATTGCCA TGTTCAGTTA CACATTCGGC 3000 

AAAGTACAGG GCAACAGCGA TCTCTACTGG AAGGCGCAGC GTTACCGCCT CATCCGGGAA 3060 

TTCCACTCTC GGCCCGCGCT GGCCCCGCCC TTTATCGTCA TCTCCCACTT GCGCCTCCTG 3120 

CTCAGGCAAT TGTGCAGGCG ACCCCGGAGC CCCCAGCCGT CCTCCCCGGC CCTCGAGCAT 3180 

TTCCGGGTTT ACCTTTCTAA GGAAGCCGAG CGGAAGCTGC TAACGTGGGA ATCGGTGCAT 3240 

AAGGAGAACT TTCTGCTGGC ACGCGCTAGG GACAAGCGGG AGAGCGACTC CGAGCGTCTG 3300 

AAGCGCACGT CCCAGAAGGT GGACTTGGCA CTGAAACAGC TGGGACACAT CCGCGAGTAC 3360 

GAACAGCGCC TGAAAGTGCT GGAGCGGGAG GTC CAGCAGT GTAGCCGCGT CCTGGGGTGG 342 0 

GTGGCCGAGG CCCTGAGCCG CTCTGCCTTG CTGCCCCCAG GTGGGCCGCC ACCCCCTGAC 3480 
CTGCCTGGGT CCAAAGAC TG A 

SEQ ID N0:218 PAV9 Protein sequence: 



AVEDHQMAST GGTKWAMGV APWGWRNRD TLINPKGSFP ARYRWRGDPE DGVQFPLDYN 180 

YSAFFLVDDG THGCLGGENR FRLRLESYIS QQKTGVGGTG IDIPVLLLLI DGDEKMLTRI 240 

ENATQACLPC U.VAGSGGAA DCLAETLEDT LAPGSGGARQ GEARDRIRRF FPKGDLEVLQ 300 

AQVERIMTRK ELLTVYSSED GSEEFETIVL KALVKACGSS EASAYLDELR LAVAWNRVDI 360 

AQSEtiFRGDI QWRSFHLEAS IMDAiLNDRP EFVRLLISHG LSLGHFLTPH RLAQLYSAAP 420 

SNSLIRNLLD QASEtSAGTKA PALKGGAAEL RPPDVGHVLR MLLGKHCAPR YPSGGAWDPH 480 

PGOGFGESMY LLSDKATSPL 5LDAGLGQAP WSDLLLWALL LNRAQMflMYF WEMGSNAVSS 540 

ALGACLLLRV HARLEPDAEE AARRKOLAFK FEGMGVDLFG ECYRSSEVRA ARLLLRRCPL 600 

WGDATCLQLA MQADARAFFA QDGVQSLLTQ KWWGDMASTT PIWALVLAFF CPPLIYTRIJ 660 

TFRKSEEEPT REELEFDMDS VINGEGPVGT ADPAEKTPLG VPRQSGRPGC CGGRCGGRRC 720 

LRRWFHFWGA FVTIFMGNW SYIXFLLLFS RVLLVDFQPA PPGSLELLLY FWAFTLLCEE 780 

LRQGLSGGGG SLASGGPGPG HASLSQRLRL YLADSWNQCD LVALTCFLLG VGCRLTPGLY 840 

HLGRTVLCID FMVFTVRLLH IFTVNKQLGP KIVIVSKMMK DVFFFLFFLG VWLVAYGVAT 900 

EGLLRPRDSD FPS1LRRVFY RPYLQIFGQI PQEDMDVALM EHSNCSSEPG FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLVNL LIAMFSYTFG KVCGNSDLYW KAQRYRLIRE 1020 

FHSRPALAPP FIVISHLRLL LRQLCRRPRS PQPSSPALEH FRVYLSKEAE RKLLTWESVH 1080 

KENFLLARAR DKRESDSERL KRTSQKVDLA LKQLGHIREY EQRLKVLERE VQQCSRVLGW 1140 
VAEALSRSAL LPPGGPPPPD LPGSKD 

SEQ ID N0:219 PBF1 DNA SEQUENCE 

Nucleic Acid Accession* AA054237 

Coding sequence: 1 -894 (underlined sequences correspond lo start and stop codons) 



CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGJ 
GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 
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CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CCCCCGCTGG GGCGCCGGCT GCTCCCGGGC 
GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CGCTCGCTCC TGGGGCTCGG CGGGCTGGAC 
GCCGAGTGCG GCCGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 
CTGGGCATCG ACCG GG AC AT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 
T ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 




ATTAGCCGGA CCAAGATTGC 



SEQ ID NO:220FBF1Prp1ein sequence: 



: WYETDPRRHK ESCERSRAGA DPPDQKNRLM 
PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 
LGIDRDIDTL ILKGIAQRCT AIKYHFSQPI RLRNIPFNLT KTIQQDEWHL LHLRRITAGF 



SEQ ID NO:221 PCI4DNA SEQUENCE 

NM_016S70 

1- 1 134 {undefined sequences correspond lo start and stop codons) 



AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 



TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTCAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTGA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 480 

GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 540 
GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC C 
CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT C 
TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG A 



TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 
CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 
ATGGTGACAG TTACTGAGGA GCACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 
50 ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAGTT 1020 
GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTMAGAAA ATAATACACA TTGA 

55 

Protein Access 

1 11 21 31 41 51 

OU MRRLNRKKTL SLVKELDAFP KVPESYVETS ASGGTVSLIA FTTMALLT1M EFSVYQDTWM 
KYEYEVCKDF SSKLRIKIDI TVAMKCQYVG ADVLDLAETM VASADGLVYE PTVFDLSPQQ 
KEWQHMLQLI QSRLQEEHSL QDVIFKSAFK STSTALPPRE DDSSQSPNAC RIHGHLYVNK 
VAGNFH ITVG KAIPHPRGHA HLAALVNHES YMFSHRIDHL SFGELVPAII NPLDGTEKIA 
IDHNQMFQYF 1TWPTKLHT YKISADTHQF SVTERERIIN HAAGSHGVSG IFMKYDLSSL 

65 MVTVTEEHMP FWQFFVRLCG IVGGIFSTTG MLHGIGKFIV EIICCRFRLG SYKPVNSVPF 



SEQ ID NO:223 PEZ3 DNA SEQUENCE 

Nucleic Acid Accession*: NM_001935.1 

Coding sequence: 76-2301 (underlined sequences correspond to start and stop codons) 



CGCGCGTCTC CGCCGCCCGC GTGACTTCTG CCTGCGCTCC TTCTCTGAAC GCTCACTTCC 
GAGGAGACGC CGACGATGAA GACACCGTGG AAGATTCTTC TGGGACTGCT GGGTGCTGCT 
GCGCTTGTCA CCATCATCAC CGTGCCCGTG GTTCTGCTGA ACAAAGGCAC AGATGATGCT 
ACAGCTGACA GTCGCAAAAC TTACACTCTA ACTGATCACT TAAAAAATAC TTATAGACTG 
AAGTTATACT CCTTAAGATG GATTTCAGAT CATGAATATC TCTACAAACA AGAAAATAAT 
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T TCAATGCTGA ATATGGAAAC AGCTCAGTTT TCTTGGAGAA CAGTACATTT 360 

GATGAGTTTG GACATTCTAT CAATGATTAT TCAATATCTC CTGATGGGCA GTTTATTCTC 420 

TTAGAATACA ACTACGTGAA GCAATGGAGG CATTCCTACA CAGCTTCATA TGACATTTAT 480 

GATTTAAATA AAAGGCAGCT GATTACAGAA GAGAGGATTC CAAACAACAC ACAGTGGGTC 540 

ACATGGTCAC CAGTGGGTCA TAAATTGGCA TATGTTTGGA ACAATGACAT TTATGTTAAA 500 

ATTGAACCAA ATTTACCAAG TTACAGAATC ACATGGACGG GGAAAGAAGA TATAATATAT 660 

AATGGAATAA CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTGTGG 720 

TGGTCTCCAA ACGGCACTTT TTTAGCATAT GCCCAATTTA ACGACACAGA AGTCCCACTT 780 

ATTGAATACT CCTTCTACTC TGATGAGTCA CTGCAGTACC CAAAG AC TGT ACGGGTTCCA 840 

TATCCAAAGG CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT 900 

CTCAGCTCAG TCACCAATGC AACTTCCATA CAAATCACTG CTCCTGCTTC TATGTTGATA 960 

GGGGATCACT ACTTGTGTGA TGTGACATGG GCAACACAAG AAAGAATTTC TTTGCAGTGG 1020 

CTCAGGAGGA TTCAGAACTA TTCGGTCATG GATATTTGTG ACTATGATGA ATCCAGTGGA 1080 

AGATGGAACT GCTTAGTGGC ACGGCAACAC ATTGAAATGA GTACTACTGG CTGGGTTGGA 1140 

AGATTTAGGC CTTCAGAACC TCATTTTACC CTTGATGGTA ATAGCTTCTA CAAGATCATC 1200 

AGCAATGAAG AAGGTTACAG ACACATTTGC TATTTCCAAA TAGATAAAAA AGACTGCACA 1260 

TTTATTACAA AAGGCACCTG GGAAGTCATC GGGATAGAAG CTCTAACCAG TGATTATCTA 1320 

TACTACATTA GTAATGAATA TAAAGGAATG CCAGGAGGAA GGAATCTTTA TAAAATCCAA 1380 

CTTATTGACT ATACAAAAGT GACATGCCTC AGTTGTGAGC TGAATCCGGA AAGGTGTCAG 1440 



GACAATTCAG CTTTGGATAA AATGCTGCAG AATGTCCAGA TGCCCTCCAA AAAACTGGAC 1620 

TTCATTATTT TGAATGAAAC AAAATTTTGG TATCAGATGA TCTTGCCTCC TCATTTTGAT 1680 

AAATCCAAGA AATATCCTCT ACTATTAGAT GTGTATGCAG GCCCATGTAG TCAAAAAGCA 1740 

GACACTGTCT TCAGACTGAA CTGGGCCACT TACCTTGCAA GCACAGAAAA CATTATAGTA 1800 

GCTAGCTTTG ATGGCAGAGG AAGTGGTTAC CAAGGAGATA AGATCATGCA TGCAATCAAC 1860 

AGAAGACTGG GAACATTTGA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA 1920 

ATGGGATTTG TGGACAACAA ACGAATTGCA ATTTGGGGCT GGTCATATGG AGGGTACGTA 1980 

ACCTCAATGG TCCTGGGATC GGGAAGTGGC GTGTTCAAGT GTGGAATAGC CGTGGCGCCT 2040 

GTATCCCGGT GGGAGTACTA TGACTCAGTG TACACAGAAC GTTACATGGG TCTCCCAACT 2100 

CCAGAAGACA ACCTTGACCA TTACAGAAAT TCAACAGTCA TGAGCAGAGC TGAAAATTTT 2160 

AAACAAGTTG AGTACCTCCT TATTCATGGA ACAGCAGATG ATAACGTTCA CTTTCAGCAG 2220 

TCAGCTCAGA TCTCCAAAGC CCTGGTCGAT GTTGGAGTGG ATTTCCAGGC AATGTGGTAT 2280 

ACTGATGAAG ACCATGGA AT AGC TAGCAGC ACAGCACACC AACATATATA TACCCACATG 2340 

AGCCACTTCA TAAAACAATG TTTCTCTTTA CCTTAGCACC TCAAAATACC ATGCCATTTA 2400 

AAGCTTATTA AAACTCATTT TTGTTTTCAT TATCTCAAAA CTGCACTGTC AAGATGATGA 2460 

TGATCTTTAA AATACACACT CAAATCAAGA AACTTAAGGT TACCTTTGTT CCCAAATTTC 2520 

ATACCTATCA TCTTAAGTAG GGACTTCTGT CTTCACAACA GATTATTACC TTACAGAAGT 2580 

TTGAATTATC CGGTCGGGTT TTATTGTTTA AAATCATTTC TGCATCAGCT GCTGAAACAA 2640 

CAAATAGGAA TTGTTTTTAT GGAGGCTTTG CATAGATTCC CTGAGCAGGA TTTTAATCTT 2700 

TTTCTAACTG GACTGGTTCA AATGTTGTTC TCTTCTTTAA AGGGATGGCA AGATGTGGGC 2760 

AGTGATGTCA CTAGGGCAGG GACAGGATAA GAGGGATTAG GGAGAGAAGA TAGCAGGGCA 2820 
TGGCTGGGAA CCCAAGTCCA AGCATACCAA CACGAGCAGG C 
AGAAGAGCTG TTCACCACGA GACTGGCACA G 
CAGGAAATCA AATATCGAAA GCACTGACTT CTAAGTAAAC C 
AAAGAAATGT AAGGGAAACT GCCAGCJ 
TGCTACAAAA ACACAGCAAG GGTGATGGGA A 
TACTGATGTT CCTAGTGAAA GAGGCAGCTT GAAACTGAGA T 

CTGTTAAAAG ATGAAAATAT TTGTATCACA AATCTTAACT TGAAGGAGTC CTTGCATCAA 3240 

TTTTTCTTAT TTCATTTCTT TGAGTGTCTT AATTAAAAGA ATATTTTAAC TTCCTTGGAC 3300 

TCATTTTAAA AAATGGAACA TAAAATACAA TCTTATGTAT TATTATTCCC ATTCTACATA 3360 
CTATGGAATT TCTCCCAGTC ATTTAATAAA TCTGCCTTCA TTTTTTC 



55 $EQIDN0:224PEZ3Pr9telnse 



1 11 21 31 41 51 

I I I I I I 

MKTPWKIIAG LLGAAALVTI ITVPWLLNK GTDDATADSR KTYTLTDYLK NTYRLKLYSL 6( 

RWISDHEYIY KQENNILVFN AEYGNSSWL EHSTFDEFGH SINDYSISPD GQFILLEYNY 12( 

VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSFV GHKLAYVWNN DIYVKIEPNL 18( 

PSYRITWTGK EDIIYNGITD WVYEEEVFSA YSALWWSPNG TFLAYAQFND TEVPLIEYSF 24( 

YSDESLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT NATSIQITAP ASHLIGDHYL 30( 

CDVTWATQBR ISLQWLRRIQ NYSVMDICDY DESSGRWNCL VARQHIEMST TGWVGKFRPS 36{ 

EPHFTLDGNS FYKIISNEEG YRHICYFQ1D KKDCTFITKG TWEVIGIEAL TSDYLYYISN 42( 

EYKGMPGGEN LYKIQLIDYT KVTCLSCELN PERCQYYSVS FSKEAKYYQL RCSGPGLPLY 481 

TLHSSVNDKG LRVLEDHSAL DKMLQNVQMP SKKLDFIILN ETKFWYQMIL PPHFDKSKKY 541 

PLLLDVYAGP CSOKADTVFR LNVJATYLAST ENIIVASFDG RGSGYQGDKI MHAINRRLGT 601 

FEVEDQIEAA RQFSKMGFVD NKRIAIWGWS YGGYVTSMVL GSGSGVFKCG IAVAPVSRWE 661 

YYDSVYTERY MGLPTPEDNL DHYRNSTVMS RAENFKQVEY LLIHGTADDN VHFQQSAQIS 721 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 

SEQ ID N0:225 PBJ2 DNA SEQUENCE 

Nucleic Acid Accession*; none found 

Coding sequence; 1-261 (underlined sequences correspond to start and stop codons) 
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ATGOCTCTGG CGAAGGTGAG GQAGCCAAAC GCAAATGACA ATGCCATCAG AGTTGACAAC 
AGAAGTGTGA TTAAAGTGCG TGCTAACCAG TGTTCCCTGC ATGAGGCAGA AAGTGAATCC 
AGAAACCCTC AGGAGCTCTG GATGGGCCTG CTCCTCTTGA TGGGGGTCCT AGAAGCATGT 
GTGGAAATGA GGCCTCTGTC AGTCTGGTCC CTGAGAGATG ACAAGGAGCA G. 
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MALAKVREPN ANDNAIRVDN RSVIKVRANQ CSLHEAESES RHPQELWMGL LLLMGVLEAC 
VEMRPLSVWS LRDDKEQSPH QPTLDV 

SEQID NO:227 PBM2 DNA SEQUENCE 



ATGCCAAATG CTGAGTTAGA AGCAAAGAGC CTTGGAAGCA GTAAATGTTT AAAAACTGCT 
CTCATACTTG CTGTATGTTG TGGATCAGCA AATATAGTCA GCCCTCTACT TGAGCAAAAT 
ATTGATGTAT CTTCTCAAGA TCTGGACAGA CGGCCAGAGA GTATGCTGTT TCTAGTCATC 
ATCATGTGGA CCAGTTTTGT GGAAGACAAT CTTTCCATGG GCTGGGGGAA GCTAGAAGAT 
TTTATGGCTA TTGAAGAAGA AATGAAGAAG CACGGAAGTA CTCATGTGGG ATTCCCAGAA 
AACCTGACTA ATGGTGCCGC TGCTGGCAAT GGTGATGATG GATTAATTCC TCCAAGGAAG 
AGCAGAACAC CK3AAAGCCA GCAATTTCCT GACACTGAGA ATGAAGAGTA TCACAGGTTT 
GTCAAAGATC AGATAGTTGT AGATATGCGG CGTTATTTC T GA 

SE(aiDHO:228PBM;Pr 9 tein??quenC0-. 



MPNAELEAKS LGSSKCLKTA LIIAVCCGSA NIVSPLLEQN IDVSSQDLDR RPESMLFLVI 
IMWTSFVEDN LSMGWGKLED FMAIEEEMKK HGSTHVGFPE NLTNGAAAGN GDDGLIPPRK 
40 SRTPESQQFP DTENEEYHRF VKDQIVVDMR RYF 

SEQ ID 110:229 PEZ2 DNA SEQUENCE 

Nucleic Acid Accession*: NMJH4253 

luence: 65-8242 (underlined sequences correspond lo start and stop codons) 



GACTGCTTGC ATTAAAGGAC TTCCTCATCC TTTTITTCAT GARACTGAGC TTGCTTAATC 60 

AGAGATGGAG CAAACTGACT GCAAACCCTA CCAGCCTCTA CCAAAAGTCA AGCATGAAAT 120 

GGATCTAGCT TACACCAGTT CTTCTGATGA GAGTGAAGAT GGAAGAAAAC CAAGACAGTC 180 

ATACAACTCC AGGGAGACCC TGCACGAGTA TAACCAGGAG CTGAGGATGA ATTACAATAG 240 

CCAGAGTAGA AAGAGGAAAG AAGTAGAAAA ATCTACTCAA GAGATGGAAT TCTGTGAAAC 300 

CTCTCACACT CTOTGCTCTG GCTACCAAAC AGACATGCAC AGCGITTCTC GGCATGGCTA 360 

CCAGCTAGAG ATGGGATCTG ATGTGGACAC AGAGACAGAA GGTGCTGCCT CACCTGACCA 420 



TGTGCAGAGC AGCCCACACA ACCAGTTCAC CTTCAGACCC CTCCCACCGC CACCTCCGCC 
TCCTCATGCC TGCACCTGTG CCAGGAAGCC ACCCCCTGCA GCGGACTCTC TTCAGAGGAG 
ATCAATGACT ACCCGCAGCC AGCCCAGCCC AGCTGCTCCA GCTCCCCCAA CCAGCACGCA 
GGATTCAGTC CAICTGCATA ACAGCTGGGT CCTGAACAGC AACATACCAT TGGAGACCAG 
GCATTCCCTG TTCAAACATG GATCTGGTTC CTCTGCGATC TTCAGTGCAG CCAGTCAGAA 
CTACCCTCTG ACATCCAATA CCGTGTACTC GCCCCCTCCC AGGCCTCTTC CTCGAAGCAC 
CTTTTCCCGA CCTGCCTTTA CCTTTAACAA ACCTTACAGG TGCTGCAACT GGAAGTGCAC 
AGCATTGAGC GCCACTGCAA TCACAGTGAC TTTGGCCTTG TTACTAGCCT 
AGTGCATTTG TTCGGCCTGA CTTGGCAGTT GCAACCAGTT GAAGGAGAGC 
TGGAGTTAGC AAAGGGAACA GGGGGACCGA GTCCATGGAC ACTACTTACT CTCCAATTGG 
AGGAAAAGTT TCTGATAAAT CAGAGAAAAA AGTGTTTCAG AAGGGACGGG CGATAGACAC 
TGGAGAAGTT GACATTGGTG CACAGGTCAT GCAGACCATT CCACCTGGTT TATTCTGGCG 
TTTCCAGATT ACTATCCACC ATCCAATATA TCTGAAGTTC AATATTTCTT TAGCCAAGGA 
CTCTCTGCTG GGAATTTATG GCAGAAGAAA CATTCCACCT ACACATACTC AGTTTGATTT 
TGTAAAACTA ATGGATGGCA AACAGCTGGT CAAGCAGGAC TCCAAGGGCT CTGATGATAC 
ACAGCACTCC CCTCGGAACC TGATCTTAAC TTCGCTTCAG GAGACAGGTT TCATAGAGTA 
TATGGATCAA GGACCTTGGT ATCTCGCGTT TTACAATGAT GGAAAAAAGA TGGAGCAAGT 
ATTCGTGTTA ACTACAGCAA TTGAAATAAT GGATGACTGT TCAACCAATT GCAATGGAAA 
TGGAGAGTGT ATCTCTGGCC ATTGTCATTG TTTCCCAGGA TTCCTTGGAC CTGACTGTGC 
TAGAGATTCC TGCCCTGTGC TGTGTGGTGG GAATGGAGAA TACGAGAAAG GACACTGTGT 
CTGCCGGCAT GGCTGGAAGG GGCCAGAGTC TGACGTTCCG GAAGAACAAT GCATTGATCC 
AACATGCTTT GGCCACGGCA CCTGCATCAT GGGAGTCTX3C ATCTGTGTGC CAGGATACAA 



WO 02/30268 



PCT/US01/32045 



AGGAGAAATA TGCGAGGAAG AGGACTGCCT AGACCCAATG TGTTCCAACC ATGGCATCTG 1980 

TGTAAAAGGA GAATGTCACT GTTCTACTGG CTGGGGAGGA GTTAACTGTG AAACACCACT 2040 

TCCTGTATGT. CAAGAGCAGT GCTCAGGACA CGGAACTTTT CTTCTGGACG CTGGAGTATG 2100 

CAGCTGTGAT CCCAAGTGGA CAGGATCTGA CTGCTCAACA GAGCTGTGTA CCATGGAGTG 2160 

TGGTAGCCAT GGAGTCTGCT CAAGAGGAAT TTGCCAGTGT GAAGAAGGCT GGGTAGGACC 2220 

AACATGTGAG GA&CGCTCCT GTCATTCTCA TTGTACTGAG CATGGCCAAT GCAAAGATGG 2280 

AAAATGTGAG TGTAGCCCTG GATGGGAGGG CGACCACTGC ACAATTGCTC ACTACTTAGA 2340 

TGCTGTCCGA GATGGCTGCC CAGGGCTCTG CTTTGGAAAT GGACGATGTA CCCTGGATCA 2400 

AAATGGTTGG CACTGTGTGT GTCAGGTGGG TTGGAGTGGG ACAGGCTGCA ATGTTGTCAT 2460 

GGAAATGCTT TGTGGAGATA ACTTGGACAA TGATGGAGAT GGTTTAACCG ACTGTGTGGA 2520 

TCCTGACTGT TGTCAACAAA GCAACTGTTA TATAAGTCCT CTCTGCCAGG GCTCACCAGA 2580 
TCCTCTTGAC CTCATTCAGC AAAGCCAAAC TCTCTTCTCT CAGCACACTT C 
TTATGATCGA ATCAAATTCC TCATTGGCAA GGACAGTACT CATGTC7 

GTCATTTGAC AGCAGGCGTG CCTGTGTGAT TCGAGGCCAA GTGGTGGCCA TAGATGGAAC 2760 

TCCTCTAGTG GGAGTQAATG TCAGTTTCTT GCACCACAGT GATTATGGGT TTACCATCAG 2820 

C CATCGGTGGC ATGTC TGTCA TCTTAATCTT 2880 
2 CCTTTCCTGC CTGAGAAGAG AACACTCTGG TTGCC TTGGA A 
TGTGGTAGAG AAAGTCACCA TGCAGAGAGT TGTATCAGAC CCGCCATCCT G 

V TCCTTCACCG CTCACATCAT TTGGAGGGTC 3060 



3120 

T ATAAAACCCT 3180 

GCTACGGATC CTTCTGACAC ATTCAACGAT TCCCGTAGGC ATGATAAAAG TACACCTCAC 3240 

AGTAGCTGTG GAAGGGCGAC TCACACAGAA GTGGTTTCCC GCCGCAATTA ATCTTGTCTA 3300 

CACATTTGCT TGGAACAAGA CCGATATCTA TCGACAGAAG GTTTGGGGCC TGGCAGAGGC 3360 

TTTGGTATCT GTGGGATATG AATATGAAAC GTGCCCTGAC TTTATTCTCT GGGAGCAAAG 3420 

GACAGTCGTT TTACAAGGTT TTGAGATGGA TGCTTCTAAC CTAGGAGACT GGTCTTTGAA 3480 

TAAGCATCAC ATTTTGAATC CTCAAAGTGG AATCATACAT AAAGGGAATG GAGAAAATAT 3540 

GTTCATTTCC CAGCAGCCCC CAGTCATATC AACCATAATG GGTAATGGAC ACCAAAGGAG 3600 

TGTAGCCTGC ACCAACTGCA ATGGCCCAGC CCACAACAAC AAACTCTTTG CTCCTGTCGC 3660 

CTTAGCTTCT GGCCCTGATG GCAGTGTGTA TGTTGGCGAC TTCAATTTTG TAAGGAGAAT 3720 

ATTTCCCTCG GGAAACTCCG TTAGTATTTT GGAATTAAGC ACAAGTCCTG CTCACAAATA 3780 

CTATCTGGCT ATGGACCCTC TGTCIGAATC ACTCTATCTA TCAGACACCA ATACTCGCAA 3840 

AGTCTACAAG TTGAAATCTC TTGTGGAGAC GAAAGATCTG TCCAAGAATT TTGAAGTGGT 3900 

GGCAGGAACT GGTGATCAGT GCCTTCCCTT TCACCAGAGT CATTGTGGAG ATGGTGGGAG 3960 

AGCATCGGAA GCTTCACTGA ATAGCCCTCG AGGCATCACA GTTGATAGGC ATGGATTTAT 4020 

TTACTTTGTG GATGGGACTA TGATTCGCAA AATTGATGAG AATGCTGTGA TCACAACTGT 4080 

AATCGGCTCA AATGGTCTGA CTTCCACACA ACCACTGAGC TGTGACTCAG GAATGGACAT 4140 

CACTCAGGTG CGATTAGAGT GGCCAACAGA CCTTGCAGTA AATCCTATGG ACAATTCATT 4200 

GTATGTCTTG GATAACAACA TTGTGCTGCA AATTTCTGAG AACAGGCGTG TTCGGATCAT 4260 
CGCAGGACGC CCCATTCACT GCCAGGTGCC AGGCATCGAT CATTTCCTGG TCAGCAAGGT 



A CCACCAATGG 



4380 



TGACTGTTTT TCAGGTGMG GTGGCTATGC CAAAGATGCA AAGA 
CTTAGCAGTG TCGCCTGATC GAACCCTCTA TGTGGCAGAC 
TACCATCAGC AGGAACCAAG 



CGCTGATCAG GAACTGTACC AGTTCACTGT AAATGGAACC CACCTACACA CCCTGAACTT 4740 

GATAACAAGG GACTATGTTT ATAACTTCAC CTACAATTCT GAAGGTGACT TGGGCGCGAT 4800 

TACCAGCAGC AATGGCAATT CAGTGCACAT TCGCCGTGAT GCAGGCGGAA TGCCGCTATC 4860 

3GCGGAC AAGTATACTG GCTGACTATA AGCAGCAATG GAGTCCTGAA 4920 

A GCCCAAGGCT ATAATCCGGC CTTAATGACC TATCCAGGAA ACACAGGGCT 4980 
TCTGGCTACC AAAAGTAACG AAAATGGATG GACAACCGTT TATGAGTATG A 



GAAGCTGACA AAAGTGGAGC TAGATACTIC CAACCGTGAA AATGTCCTCA TGTCAACCAA 5160 

CTTGACGGCA ACTAGTACCA TATATATTTT AAAACAAGAA AATACTCAAA GTACCTATCG 5220 

GGTGAATCCA GATGGTTCCC TGCGTGTCAC TTTTGCCAGC GGGATGGAGA TCGGCCTCAG 5280 

CTCAGAGCCC CACATCCTGG CAGGGGCAGT CAACCCTACC CTGGGCAAAT GCAACATCTC 1 5340 

ATTGCCCGGA GAGCACAATG CAAACCTCAT CGAGTGGCGG CAGAGGAAGG AGCAAAACAA 5400 

AGGCAATGTT TCGGCTTTTG AAAGGAGGCT GAGGGCCCAC AACAGAAACC TACTCTCCAT 5460 

AGATTTTGAT CATATAACCC GCACAGGAAA GATCTATGAT GACCATCGAA AATTCACCCT 5520 

TCGAATTCTT TATGACCAGA CTGGGCGACC CATTCTGTGG TCTCCTGTAA GCAGATATAA 5580 

TGAAGTGAAC ATCACATATT CACCTTCGGG ATTGGTGACG TTTATTCAAA GAGGAACGTG '5640 

;AATATG ACCAGAGTGG GAAAATTATT TCAAGAACTT GGGCTGATGG 5700 

G AGCTATACCT ACTTAGAAAA ATCTGTGATG CTTCTCCTAC ACAGCCAGCG 5760 

GCGTTACATC TTTGAGTATG ACCAATCAGA TTGCCTGCTG TCAGTTACCA TGCCTAGCAT 5820 

GGTGCGCCAC AGCTTACAAA CCATGCTTTC AGTGGGCTAC TACCGTAATA TCTACACCCC 5880 

ACCGGACAGT AGCACTTCTT TTATCCAAGA CTATAGTCGA GATGGCCGAT TGCTACAGAC 5940 

CCTGCATCTG GGGACAGGGC GCAGAGTCTT ATACAAGTAC ACCAAGCAAG CAAGGCTTTC 6000 

TGAGGTTCTC TATGATACCA CTCAGGTCAC ATTAACATAT GAAGAGTCTT CTGGAGTGAT 6060 

TAAGACAATA CACCTGATGC ATGACGGATT CATCTGCACA ATCAGATACA GGCAAACAGG 6120 

ACCTCTTATT GGACGCCAGA TTTTCAGATT CAGTGAAGAA GGCCTTGTGA ATGCACGGTT 6180 

CGACTACAGC TACAACAATT TCCGAGTCAC AAGCATGCAA GCTGTAATCA ATGAAACCCC 6240 

TTTGCCTATA GATCTTTACC GATATGTTGA TGTCTCTGGC AGAACAGAGC AGTTTGGAAA 6300 

ATTCAGTGTA ATTAATTACG ATTTAAATCA GGTCATAACT ACTACAGTGA TGAAACACAC 6360 

CAAAATCTTC AGTGCCAATG GACAAGTCAT TGAAGTCCAA TATGAAATCC TAAAGGCAAT 6420 

TGCCTACTGG ATGACCATTC AATATGATAA TGTGGGCCGA CATGGTAATA TGTGCATAAG 6480 

GGTAGGAGTA GATGCCAATA TAACAAGGTA CTTCTATGAA TACGATGCTG ATGGGCAACT 6540 

TCAGACTGTT TCTGTAAATG ACAAAACCCA GTGGCGTTAT AGTTACGATC TGAATGGAGA 6600 

CATCAACCTC TTAAGCCATG GGAAGAGTGC TCGTCTTACT CCTCTCCGAT ATGACCTCCG 6660 

AGACCGCATC ACCAGATTAG GAGAAATTCA GTATAAAATG GATGAAGATG GCTTTCTGAG 6720 
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GCAGAGGGGA AATGATATTT TTGAATATAA TTCTAATGGC CTCCTGCAGA AAGCCTACAA 6780 
TAAGGCTTCT GGCTGGACTG TGCAGTATTA CTATGATGGG CTTGGGCGAC GTGTCGCGAG 6840 
TAAGTCCAGC CTAGGGCAGC ACCTTCAGTT CTTTGTCGAC GCGACCGCGA ACCCCATAAG 6900 
AGTTACTCAT TTCTACAACC ACACAAGCTC GGAGATTACA TCTCTGTATT ATGATCTCCA 6960 
G AGTTAAGCAG TGGTGAAG 

7080 



7200 
7320 

TGTTGCAAAG TATACCACAG ACATCAGAAG TTGGTTGGAG CTATTTGGTT TCCAATTACA 7380 

CAATGTACTA CCTGGATTTC CCAAACCTGA ATTAGAAAAT TTAGAATTAA CTTACGAGCT 7440 

TCTACGGCTT CAGACAAAAA CTCAAGAGTG GGATCCTGGA AAGACTATCC TGGGCATTCA 7500 

GTGTGAACTC CAGAAACAGC TCAGGAATTT CATTTCCTTG GACCAACTAC CTATGACTCC 7560 

T GCCTTGAAGG AGGGAAGCAA CCAAGGTTTG CTGCTGTCCC 7620 

T GGGAAAGGTA TAAAATTTGC CATCAAGGAT GGCATAGTAA CAGCTGATAT 7680 

3 ATAGCAGGCG GCTTGCTGCC ATTCTCAATA ATGCCCATTA 7740 

CCTGGAAAAC CTACATTTTA CCATAGAGGG GAGGGACACT CACTACTTCA TTAAGCTTGG 7800 

GTCTCTGGAG GAAGACCTGG TGCTCATCGG TAACACTGGG GGGAGGCGGA TTCTGGAGAA 7860 

TGGTGTCAAT GTCACTGTGT CCCAGATGAC TTCTCTGTTG AATGGGAGGA CTAGACGGTT 7920 

TGCAGATATT CAGCTCCAGC ATGGAGCCCT GTGCTTCAAC ATCCGGTATG GGACAACTGT 7980 

CGAAGAGGAA AAGAATCACG TGTTGGAGAT TGCCAGACAG CGCGCAGTGG CCCAGGCCTC 8040 

GACTAAGGAA CAAAGAAGGC TGCAAGAGGG GGAAGAGGGG ATTAGGGCAT GGACAGAAGG 8100 

GGAAAAGCAG CAGCTTTTGA GOACTGGGCG GGTACAAGGT TACGATGGGT ATTTTGTTTT 8160 

GTCTGTTGAG CAGTATTTAG AACTTTCTGA CAGTGCCAAT AATATTCACT TTATGAGACA 8220 

GAGCGAAATA GGCAGGAG GT AAC AAAAATA 1CTCTGCCTT TGCGTCACCA AAGACTGCCT 8280 

GTTTTTAAAA CATAAAATGG TTTATTGTAT TGGTTTTCTA GATCAGAACT CTGTATATGT 8340 

AAATATGGAG GAAAAACATA TCCAACTGCC TTTCAATGTG ACGGAAGATG GTATCTTAAT 8400 

ATTGTTTGTT TAAACTCTTT AAGAAATGAC AGAGATTTTT AGTTCTTGTG TGGCAGTATT 8460 

CAAAATAACA CAAGTAGAAC TCAAACAGCT AAAAACAGTT TTCAGAAAGC ACCACTTTCA 8520 

ATTTGCCGAG CCATGCATAT GTTCCAATAT CCAGAAAGAA CCCAAGGTTC TCTATCTCTA 8580 

TTGTGAGAAG CAGTTTCATC CTTAACTGTT GGCAGAACTT ACGGGCTATT TGAATAGGTG 8640 

GTGCAATAGT ATCTGAAACT TGCCTTTCGA AAGACTGCCA GCCCTTTGAC GTTTTCCAGA 8700 

TCTGTTATAG GAAACTTAAA AACAGGTGTA AAATGTCTTC AGCCACCATC TCCTAGAGTG 8760 

AGGACCCAAT TGCCCTTCCT TCTTGATTAT TCCTCCTTCC TTGTTAAAGT AAATGCCATA 8820 

TGTGGTAACC AGACTGTATA GCCGCTATTT GCTCGTGTGT ACATGATACC AAAGCAGCTG 8940 
GCCAGCGTGA CCTCTCTCAC ACGACCTGTT TTGACTCAAT T 



AAGGGAAGAC CAGACCAAAC ATCACAGCAG TTGCTGCCAC ATTGTTTCAG CCCACTTAGA 9360 

ATCAAGTAAA TCCTTTCCAA CCGAAAACAT TTCAACTAAC TATAGAGAGG CAGACTCATT 9480 
TTTACTAAAA TAATTTATAC AGTTAGTTAT TTTCGTTCTC CGTACTTACC CATTTATCTT 9540 
TATTTAATCG TCTCTACTGC CTAGGAAAAT AACTATTTTC CAGGACGGGT TATTTGTTCT 9600 
GCGATCATTT AAAATTTGGA GAAAGGTCAG GATTAGTGTT AATATCAGCT GCAGTWCTC 9660 
AATCTCTAGG AATCCTGCAG TAAAACAAGC CCCTTGGTGA GCTGGAAGAT TTGTGCCCAG 9720 
TGACAAAGAG ATAGTTTGTA AAATGCTGTG TAATTGTAAG TTACCACAAA TGAAAATACA 9780 
TGACAGCACA ATGTGGCCCG TAGAAAATTC CCCTGAGCCA GCTTCTGCAC TTTCATCACC 9840 
GAATCTGAAC ATTTGCTATG TCTGAAGGCA AATTTATGAT GGAATGTTAG TTTGGATTCT 9900 
TTCCAGATGC TACCTAAATG CAGTGTGGGG TCATTGCCTT GCTTTGCGAT GACAGTTTCT 9960 
TTGAAAATAT GCAAAGTCAT AAGCTCATGT TAAGGTTTTT CAAGAGTCTG CCTCCTACTA 1 
CACAAAGGAA AGCAAGGGAA AGGAAATGAC CCTGGCAAAC AGTAGGGAAG GGK3TATTCA 1 
AACATTTCAT TTTCAAAACC TTCGGGTTAG AATACCACTT ACACATGTAT TCTGAGAGAC 1 
AGAATTCATG AGGAACTCAT CTCTCTTTAT AACTGGAAAC ACACCAGCTT GATATATTGC 1 
TAATCCATAC TAAAATCATA TTATTGGGTT TTTTCTGAAT CAGGCCTGTA TTAATGGTAC 1 
AGTATTTATT CAGAATGGAA TTCTAAAATT ACTAACAAAC TTGTTGAAAA TTTGAATACC 1 
TCCACACCAA CCTAAAAATG GACCTTAAGT TCCTAGAACC TCTGATGTTC TTTTAAATTA 1 
ATGGAAAAAT AATTTGTGAA CTGTATATAG AGAGTGCATT CATAAATGTG ATTATGTATT 1 
TTATCACAAA TCCAAAATGT CAATATTAGA GTCTATTTTG CTTATATTTT AAGCAATTAT 1 



GGAAATGCAC TTTTTATTAC TTACAGCTGT GGTTTTAATA CTGCCTTGAA CTATTATTAT 
TCTTTTTACA ACTCCTAAAG CTTGAGGGAG GAAAGAAAAA AAAAACAAAA CTACTAATCA 
GTAGTAAATC GAAGAGAAAC ATTTTGGCAT TTCTTAAGAA GAAGATGGAG ATATTGAGTA 
TATCACTTCC TATTCAGCTG AATAGAAAGA ATGCCTTCAT TGACTTGCAG TTCTGCAGTT 
TAAATTATTG AAAGAACAAT TCGTTTGCAT TTCCTGATGA AAGTAAAAGC ATTTTTCAGA 
GAAACATATG AATTTCTCAT ACCCAGCAGA CAGATGGCTG ACACTGCACA GCCACACACC 
ATTCGAGTAA GTTAAAGTGA GAGCATAGTA GTTGGACTCT CCTATCAAGA ACATTCTGGG 
CTGGAGGCAG GGAATACTCC ATGGTTGTTT CTTTTTCCTA CTTAAGCCCA TTTTGTTTGT 
GCTTTTCTGT TTTGTTTTGT TTTCACTCTT GCACTACAGT CTAGAGATCC AAATGAACTG 
AAAAGTTCAA AGTTTAACAC ATTTAAATAT GWTACTTTT AGTTGTCATT CTAATCGTTA 
TTGATTAGAA GCATGACTCC TGAAGGAAAG GGAAATAAAT CTCAATTCAT ACTAACTTGC 
AACAAAACAC TTTTACCATA TAAATAAGTA TATCATTTAT TTTTAACCCA AAAAATGTAT 
AAAATAAGTG TGTCCTTTAC TGTCAATTTA TCGAGAAGAT CTATAATATA TAGACTACAT 
ATATATAATA TATACAACAT AGCCAAATGT ATGAAAACTT GACAATGTAT AATTTGGAAT 



0020 
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25 
30 



50 
55 
60 
65 
70 



TCACATGCTA CCTATGTAGA CAGGTATGAA ATTAAGTTAT AATTTTCATG AGACATTTTC 11580 
ATCACTGTTG ACACAGTTTC AAGGCATTCC ATCATGTTAT TTTGACTCIT TTTCTTTTTT 11640 
TTTTCTOTAA AAATATATTT TTAACTAGAC CAGGCCCCAC TATAATATCA CTTAAGAGAG 11700 
TCAGGGCAAA GTTTTTGCAT TTATGAAGAT GTGTTCATGT AAGGGTGATT GTAATGGAGT 11760 
TCATTGGTAA TAGAAGCAAA AGTACAGTAA CGAAGTATTG AAAAGAAAAT TTTGGAGACA 11820 
TTGGAGCATA TTATATATAG CTTGTGGAAA GACATAAGGC TACAGATGGA A' 
CCTGTTTTCT TGAAGAAATT CACATACACA TAGCTGACCT GACTAGTACT T 
CACAGCCTTC TATAAAGGTT CITrCTTCTG CAAAGAAAAC AAAACAAAAC AAAACAAAAC 12000 
AAAAAAAAAC AAAAAAAGCG CAAAAAACAA AAAAACAAAA AAAAGCAAAG TAAAATTTAA 12060 



TGTGTCCTTC ATGTGAAAGC TATTAAGGAC CAAATATACT ACTGTTCATA AGAAGAAATT 12180 
ACTTTCTAAA CAGTAACTGA AAATACTTAG AGTTAAACTT GCTGTGGATT TTGTCTTGGC 12240 
AGTTGTCATC TTACATTATT TGTCAAAGGA AATGTGTTTG GCAGTTAAAA ATCTTTCCTT 12300 
AGATTTAGTG GTGGACTTTA ACCTCTTAAA TAAATGTTAG TATATCAGAT TGTGTCCTTG 12360 



12480 
12540 

GTAGASGTTC TAATCATATT GAAAATTGTG TTATAATGGT 12600 
CTGAGCCATG TTGCTAGGAA ACAATAGGTT CCAATTTTGT ATTCCTGCTC TCCTGTGCTG 12660 
20 AAAAGTGACT GGATACTGTA CAGGTTCATG TTCTCTGGCT GCAGTTAAAT GGTCTTTTGC 12720 
ATTTTGCTCT GGCTTTCAGG CCAGAAGCAT GCATTTTTCT ACAAGAGCAT CACAACAACA 12780 
TGCTGTAAAT ATTTAAAGTT AAACATTATG TGTTGATATT TGAAAGAAAA GTACTTTGAA 12840 
TATTTCATTT TTAAAAAATA AAATTGCCAA TGAAAAAAAA 



I I I I I I 

MEQTDCKPYQ PLPKVKHEMD LAYTSSSDES EDGRKPRQSY NSRETLHEYN QELRMNYNSQ 60 

SRKRKEVEKS TQEMEFCETS HTLCSGYQTD MHSVSRHGYQ. LEMGSDVDTE TEGAASPDHA 120 

LRMWIRGMKS EHSSCLSSRA NSALSLTDTD HERKSDGENG FKFSPVCCDM EAQAGSTQDV 180 

QSSFHNQFTF RPLPPPFPFP HACTCARKPP PAADSLQRRS MTTRSQPSPA AFAPPTSTQD 240 

SVHLHNSWVL NSNIPLETRH SLFKHGSGSS AIFSAASQNY PLTSNTVYSP PFRPLERSTF 300 

SRPAFTFNKP YRCCNWKCTA LSATAITVTL ALLLAYVIAV HLFGLTWQLQ PVEGELYANG 360 

VSKGNRGTES MDTTYSPIGG KVSDKSEKKV FQKGRA1DTG EVDIGAQVMQ TIPPGLFWRF 420 

QITIHHPIYL KFNISLAKDS LLGIYGRRNI PPTHTQFDFV KLKDGKQLVK QDSKGSDDTQ 480 

HSPRNLILTS LQETGFIEYM DQGPWYLAFY NDGKKMEQVF VLTTAIEIMD DCSTNCNGNG 540 

ECISGHCHCF EGFLGPDCAR DSCPVLCGGN GEYEKGHCVC RHGWKGPECD VPEEQCIDFT 600 

CFGHGTCIMG VCICVPGYKG EICEEEDCLD PMCSNHGICV KGECHCSTGW GGVNCETPLP 660 

VCQEQCSGHG TFLLDAGVCS CDPKWTGSDC STELCTMECG SHGVCSRGIC QCEEGWGPT 720 

CEERSCHSHC TEHGQCKDGK CECSPGWEGD HCTIAHYLDA VRDGCPGLCF GNGRCTLDQN 780 

GWHCVCQVGW SGTGCNWME MLCGDNLDND GDGLTDCVDP DCCQQSNCYI SPLCQGSPDP 840 

LDLIQQEQTL FSQHTSRLFY DRIKFLXGKD STHVIPPEVS FDSRRACVIR GQWAIDGTP 900 

LVGVNVSFLH HSDYGFTISR QDGSFDLVAI GGISVILIFD RSPFLPEKRT LWLPWNQFIV 960 

VEKVTHQRW SDPPSCDISN FISPNPIVLP SPLTSFGGSC PERGTIVPEL QWOEEIPIE 1020 

SSFVRLSYLS SRTPGYKTLL RILLTHSTIP VGMIKVHLTV AVEGRLTQKW F PAAINLVYT 1080 

FAWNKTDIYG QKVWGLAEAL VSVGYEYETC PDFILWEQRT WLQGFEMDA SNLGDWSLNK 1140 

HHILNPQSGI IHKGNGENMF ISQQPFVIST IMGNGHQRSV ACTNCNGPAH NNKLFAFVAL 1200 

ASGPDGSVYV GDFNFVRRIF PSGNSVSILE LSTSPAHKYY LAMDPVSESL YLSDTNTRKV 1260 

YKLKSiVETK DLSKNFEWA GTGDQCLPFD QSKCGDGGRA SEASLNSPRG ITVDRHGFIY 1320 

FVDGTMIRKI DENAVITTVI GSNGLTSTQP LSCDSGMDIT QVRLEWPTDL AWPMDHSLY 1380 

VLDNNIVLQI SENRRVRIIA GRPIHCQVPG IDHFLVSKVA IHSTLESARA ISVSHSGLLF 1440 

IAETDERKVN RIQCVTTNGE IYIIAGAPTD CDCKIDPHCD CFSGDGGYAK DAKMKAPSSL 1500 

r ISRNQAHLND MNIYEIASPA DQELYQFTVN GTHLHTLNLI 1560 

3NSVHIR RDAGGMPLWL WPGGQVYWL TISSNGVLKR 1620 

VEAQGYNPAL MTYPGNTGLL ATKSNENGWT TVYEYDPEGH LTNATFPTGE VSSFHSDLEK 1680 

LTKVELDTSN RENVLMSTNL TATSTIYILK QENTQSTYRV NPDGSLRVTF ASGMEIGLSS 1740 

EPHILAGAVN PTLGKCNISL PGEHNANLIE WRQRKEQNKG NVSAFERRLR AHNRNLLSID 1800 

FDHTTRTGKI YDDHRKFTLR ILYDQTGRPI LWSPVSRYNE VNITYSPSGL VTFIQRGTWN 1860 

EKMEYDQSGK IISRTWADGK IWSYTYLEKS VKLLLHSQRR YIFEYDQSDC LLSVTMPSMV 1920 

RHSLQTMLSV GYYRMIYTPP DSSTSFIQDY SRDGRLLQTL HLGTGRRVLY KYTKQARLSE 1980 

VLYDTTQVTL TYEESSGVIK TIHLMHDGFI CTIRYRQTGP LIGRQIFRFS EEGLVNARFD 2040 

YSYNNFRVTS MQAVINETFL PIDLYRYVDV SGRTEQFGKF SVINYDLNQV ITTTVMKHTK 2100 

IFSANGQVIE VQYEILKAIA YWMTIQYDNV GRHGNMCIRV GVDANITRYF YEYDADGQLQ 2160 

TVSVNDKTQW RYSYDLNGDI NLLSHGKSAR LTPLRYDLRD RITRLGEIQY KMDEDGFLRQ 2220 

RGNDIFEYNS NGLLQKAYNK ASGWTVQYYY DGLGRRVASK SSLGQHLQFF TOATANPIRV 2280 

THLYNHTSSE ITSLYYDLQG HLIAMELSSG EEYYVACDNT GTPLAVFSSR GQVIKEILYT 2340 

PYGDIYHDTY PDFQVI1GFH GGLYDFLTKt, VHLGQRDYDV VAGRWTTAYH HIWKQLNLLP 2400 

KPFNLYSFEN NYPVGKIQDV AKYTTDIRSW LELFGFQLHN VLPGFPKPEL ENLELTYELL 2460 

RLQTKTQEWD PGKTILGIQC ELQKQLRNFI SLDQLPMTPR YNDGRCLEGG KQPRPAAVPE 2520 

VFGKGIKFAI KDGIVTADII GVANEDSRKIj AAILNNAHYL ENLHFTIEGR DTHYFIKLGS 2580 

LEEDLVLIGN TGGRRILENG VHVTVSQMTS LLNGRTRRFA DIQLQHGALC FNIRYGTTVE 2640 
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Coding sequence: 225-2567 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 si 

I I I I I I 

CTCAGCCTTC CCGGTTCGGG AAAGGGGAAG AATGCAGGAG GGGTAGGATT TCTTTCCTGA 60 

TAGGATCGGT TGGGAAAGAC CGCAGCCTGT GTGTGTCTTT CCCTTCGACC AAGGTGTCTG 120 

TTGCTCCGTA AATAAAACGT CCCACTGCCT TCTGAGAGCG CTATAAAGGC AGCGGAAGGG 180 

TAGTCCGCGG GGCATTCCGG GCGGGGCGCG AGC AGAGAC A GGTCATGGCA GCGCCAGGCG 240 

G CTCCCCGAGT ACAGCTGCAG CTACATGGTG TCGCGGCCGG 300 

C CAGCAACAGC ACGAGCGGCG CCTGCAGGAG CGCAAGACGC 360 

TGCGGGAGAG CCTGGCCAAG TGCTGCAGTT GTTCAAGAAA GAGAGCCTTT GGTGTGCTAA 420 

AGACTCTTGT GCCCATCTTG GAGTGGCTCC CCAAATACCG AGTCAAGGAA TGGCTGCTTA 480 

GTGACGTCAT TTCGGGAGTT AGTACTGGGC TAGTGGCCAC GCTGCAAGGG ATGGCATATG 540 

CCCTACTAGC TGCAGTTCCT GTCGGATATG GTCTCTACTC TGCTTTTTTC CCTATCCTGA 600 

CATACTTTAT CTTTGGAACA TCAAGACATA TCTCAGTTGG ACCTTTTCCA GTGGTGAGTT 660 

TAATGGTGGG ATCTGTTGTT CTGAGCATGG CCCCCGACGA ACACTTTCTC GTATCCAGCA 720 

GCAATGGAAC TGTATTAAAT ACTACTATGA TAGACACTGC AGCTAGAGAT ACAGCTAGAG 780 

TCCTCATTSC CAGTGCCCTG ACTCTGCTGG TTGGAATTAT ACAGTTGATA TTTGGTGGCT 840 

G ATTCATAGTG AGGTACTTGG CAGATCCTTT GGTTGGTGGC TTCACAACAG 900 

T CCAAGTGCTG GTCTCACAGC TAAAGATTGT CCTCAATGTT TCAACCAAAA 960 

G AGTTCTCTCT ATTATCTATA CGCTGGTTGA GATTTTTCAA AATATTGGTG 1020 

ATACCAATCT TGCTGATTTC ACTGCTGGAT TGCTCACCAT TGTCGTCTGT ATGGCAGTTA 1080 

A TGATCGGTTT AGACACAAAA TCCCAGTCCC TATTCCTATA GAAGTAATTG 1140 

T TGCTACTGCC ATTTCATATG GAGCCAACCT GGAAAAAAAT TACAATGCTG 1200 

GCATTGTTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTGA ACTTCCACCT GTGAGCTTGT 1260 

TCTCGGAGAT GCjTGGCTGCA TCATTTTCCA TCGCTGTGGT GGCTTATGCT ATTGCAGTGT 1320 

CAGTAGGAAA AGTATATGCC ACCAAGTATG ATTACACCAT CGATGGGAAC CAGGAATTCA 1380 

TTGCCTTTGG GATCAGCAAC ATCTTCTCAG GATTCTTCTC TTGTTTTGTG GCCACCACTG 1440 

CTCTTTCCCG CACGGCCGTC CAGGAGAGCA CTGGAGGAAA GACACAGGTT GCTGGCATCA 1500 

TCTCTGCTGC GATTGTGATG ATCGCCATTC TTGCCCTGGG GAAGCTTCTG GAACCCTTGC 1560 

AGAAGTCGGT CTTGGCAGCT GTTGTAATTG CCAACCTGAA AGGGATGTTT ATGCAGCTGT 1620 

1680 



GCACAGATAT CTACAAAAGT ACCAAGAATT ACAAAAACAT TGAAGAACCT CAAGGAGTGA 1860 

AGATTCTTAG ATTTTCCAGT CCTATTTTCT ATGGCAATGT CGATGGTTTT AAAAAATGTA 1920 

TCAAGTCCAO AGTTGGATTT GATGCCATTA GAGTATATAA TAAGAGGCTG AAAGCGCTGA 1980 

GGAAAATACA GAAACTAATA. AAAAGTGGAC AATTAAGAGC AACAAAGAAT GGCATCATAA 2040 

GTGATGCTGT TTCAACAAAT AATGCTTTTG AGCCIGATGA GGATATTGAA GATCTGGAGG 2100 

AACTTGATAT CCCAACCAAG GAAATAGAGA TTCAAGTGGA TTGGAACTCT GAGCTTCCAG 2160 

TCAAAGTGAA CGTTCCCAAA GTGCCAATCC ATAGCCTTGT GCTTGACTGT GGAGCTATAT 2220 

CTTTCCTGGA CGTTGOTGGA GTGAGATCAC TGCGGGTGAT TGTCAAAGAA CTCCAAAGAA 2280 

TTGATGTGAA TGTGTATTTT GCATCACTTC AAGATTATGT GATAGAAAAG CTGGAGCAAT 2340 

GCGGGTTCTT TGACGACAAC ATTAGAAAGG ACACATTCTT TTTGACGGTC CATGATGCTA 2400 

TACTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT TTAGAAACGA 2460 

TCACTCTCAT TCAGGATTGT AAAGATACCC TTGAATTAAT AGAAACAGAG CTGACGGAAG 2520 

AAGAACTTGA TGTCCAGGAT GAGGCTATGC GTACACTTGC ATCCTGAAAG TGGGTTCGGG 2580 

AGGTCTCTAT GAGCAAGGAA TACAAGACAA AACTTCCTCA ATGCATTGAC TATTTCTTCA 2640 

GACTCAAAAC ACTCATTCTT TTTTCTATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 2700 

TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TGGGCAAAAT GGCTAGAATT 2760 

ATTCAGACGA TTTGGCAGCG TCCAGGGTAA GCTGGTGTTA TAATACGCTG CTGATCTACA 2820 

TCACAGATTT GCTAATAATG TTCACGTGGG CCCTGGCATA TCTCTCTTCA GTTAGAGTGA 2880 

GTGCTGACCC AACAGCCTCT GTGGTCAAGC GAGTCACGAA TGATTAATCA TAAAGAAAAA 2940 

TCAGTTTTTG ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGGTCACATT 3000 

TAGTAAATGC TGAAATAAAA TGATTAATGC ATTTATCAAT AAAAGCCTTT GAAAATACTT 3060 
TGGATAATAA ATTGGAGTTT TAAAAATGCA AATTTGCTTA GTATCTAATA A 
ATTACATATA GCCGGAATTG AGGATCTCTT TGATCCTGGA AATGGTTTAC C 

CAGAACCAGG CCAATATATT TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT 3240 

TTCATGGTTT ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGATCAA 3300 

AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT AGTCTATAGA 33 60 

AATGATCATT GCATGGAGGC ATGTATAGGT ATGATCTGTG TAAAATCTGA CATAAAAACA 3420 

GTGCTATTCT GAGTGAAAAT TTTTTTGATG TGCTTACATA ACCATGGTGA TTAAAATGAG 3480 

TTTATATTTT TTCTCAAAAA TTTTAGCAGT GTGTAAAGTA AGTAATCTTT AACTGAACTC 3540 

TGACCACTTA AAAAAAAATC TAAAAATTGA ACTACCTATA GTAGTCTGTG TTTAAAGTGA 3600 

ATTTTTAAAG ACAAAGCATT CTAAATGAAC TCAATATAAA AACATTCATT TGGAATGTAC 3660 

ATACTGAAAA ATACAGGTTT TTTTGACCAA AAGTTTTTAT ATCTTTTCTT TTTATTTATT 3720 

TTTTTCCTAA GTGCCAACAA TTTTCTAGAT ATTATATACA ACACAGGCTT TGATCTTGGG 3780 

GACTTTTCCC ATATATTTCA CACTGGAGTG AATCAAGTTG TACTTCATTT CTAGAGAAAA 3840 

GTTATACCCA GGTCCCCAAT TGAGAATGTC TTGCTTGATT GAAAACGACA TCATCCGTTG 3900 

GTATACTCCA GGGATTGGTT TCAGGACCCC TGCATTTACC AAAATTTGTG CACACTCAAG 3960 

TCCTGCAGTC ACCCCTGCCT AAAGATAGAA TGGCTTCTCT GTTTTTCTTC TGAAATACAA 4020 

CCAGAAACAA TGTGTCTATT TCTGAAAGAA TAGGATTAAT GATCATACAA ATGGGTTAAT 4080 

CCTGAATTCT GGTTGTAAAT CTGGTTACAG CATAACTAGG ATTATAATGC TGCCTCATTT 4140 

TCACAGCACT ACTTGCTTAT ATTGACAACA AATCATCTCG CTAAAGAGTG AATGTAGGCC 4200 

AGGCGCGGTG GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGTGGATCAC 4260 

GAGGTCAGGA GATCGAGACC ATCCTGGCTA ACATGGTAAA ACCCCGTCTC TACTAAAAAT 4320 

AGAAAAAAAG AAATTAGCCT AGCGTGGTGG CTGGCGGGCG CCTGTAGTCC CAGCTATTTG 4380 

GGAGGCTAAG GCAGGAGAAT GGCGTGAACC CGGGAGGCGG AGCTTGCAGT GAGCCGAGGT 4440 

CGTGCCACTG CACTCCAGCC TGGGCGACAG AGCAAGACTC CGTCTCAAAA AAAAAAAAAA 4500 
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AAAAAAAAAA AGAGTGAATG TAATAGTCTT GCAGAAAATG AATGAATACC TTTGTTCAAT 
AAAGGAAATA TGCACTGCTC ACTTTTTTGA AGGAAATGCC AAAGTTACGT TTTACAACAA 
GGCTAGAGTT TGTAAATTCT GGGTTCATTT GTGATGACAT AAGTCAGCAA ACTGCGGGAA 
TACTGItTCT TCTATGTATT TTGTGAATAG TAAGCATAAT TTTAGTTTTG TATTATCAAT 
GAAAATTTCA CTTGAAATTA AAGCTGCCTT TTGTTATATT TTTAACCTAT AGGATAAGAT 
TCCAGTATTG TATATGAGTT TTAACAAATT AAAAAATCAA ATCATGTACA TTTGAAAATA 
TTTCCACACA TTTAAAAATA AATGTAAAGT TGTCTTTTAA ACTACTCGGA TGTGTCCTTT 
CTGAACAAAA 



3 ALTLLVGIIQ LIFGGLQIGF IVRYLADPLV GGFTTAAAFQ VLVSQLKIVL 240 

lYTLVEI FQNIGDTNIA DFTAGLLTIV VCMAVKEUJD RFRHKIPVPI 300 

PIEVIVTIIA TAISYGANLE KNYNAGIVKS IPRGFLPPEL PPVSLFSEML AASFSIAWA 360 

YAIAVSVGKV YATKYDYTID GNQEFIAFGI SNIFSGFFSC FVATTALSRT AVQESTGGKT 420 

QVAGIISAAI VMIAILALGK LLEPLQKSVL AAWIANLKG MFMQLCDIPR LWRQNKIDAV 480 

IWVFTCIVSI ILGLDLGLLA GLIFGLLTW LRVQFFSWNG LGSIPSTDIY KSTKHYKNIE 540 

EPQGVKItiRF SSPIFYGNVD GFKKCIKSTV GFDAIRVYHK RLKALRKIQK LIKSGQLRAT 600 

KNGIISDAVS TNNAFEPDED IEDLEELDIP TKEIEIQVDW NSELPVKUW PKVPIHSLVL 660 

DCGAISFLDV VGVRSLRVIV KEFORIDVNV YFASLQDYVI EKLEQCGFFD DNIRKDTFFL 72 0 

TVHDAILYLQ NQVKSQEGQG SILETITLIQ DCKDTLELIE TELTEEELDV QDEAMRTLAS 780 
QDEAMRTLAS 

SEQ ID NO:233 PFH2 DNA SEQUENCE: 

Nucleic Acid Accession*: NMJJ16029 

Coding sequence: 223-1 097 (underlined sequences correspond to start and stop codons) 



| GCAGGGCAGC GACGCGACTC TGGTGCGGGC CGTCTTCTTC CCC 

GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTGTGCGCGC 120 

CTTGGTGCAG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 180 

TATGGGCCGA GTGGCAGGGA CGACGCCCAG AATGGGAGCT GACTGATATG GTGGTGTGGG 240 

TGACTGGAGC CTCGAGTGGA ATTGGTGAGG AGOTGGCTTA CCAGTTGTCT AAACTAGGAG 300 

TTTCTCTTGT GCTGTCAGCC AGAAGAGTGC ATGAGCTGGA AAGGGTGAAA AGAAGATGCC 360 

TAGAGAATGG CAATTTAAAA GAAAAAGATA TACTTGTTTT GCCCCTTGAC CTGACCGACA 420 

CTGGTTCCCA TGAAGCGGCT ACCAAAGCTG TTCTCCAGGA GTTTGGTAGA ATCGACATTC 480 

TGGTCAACAA TGGTGGAATG TCCCAGCGTT CTCTGTCCAT GGATACCAGC TTGGATGTCT 540 

ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 600 
CTCACATGAT CGAGAGGAAG CAAGGAAAGA TTGTTAC 
TATCTGTACC TCTTTCCATT GGATACTGTG CTAGCAAGCA T 

ATGGCCTTCG AACAGAACTT GCCACATACC CAGGTATAAT AGTTTCTAAC ATTTGCCCAG 
GACCTGTGCA ATCAAATATT GTGGAGAATT CCCTAGCTGG A 
GCAATAATGG AGACCAGTCC CACAAGATGA CAACCAGTCG T 

TCAGCATGGC CAATGATTTG AAAGAAGTTT GGATCTCAGA ACAACCTTTC TTGTTAGTAA 960 

CATATTTGTG GCAATACATG CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 1020 

AAAGGATTGA GAACTTTAAG AGTGGTGTGG ATGCAGACTC TTCTTATTTT AAAATCTTTA 1080 

AGACAAAACA TGACTGAAAA GAGCACCTGT ACTTTTCAAG CCACTGGAGG GAGAAATGGA 1140 

AAACATGAAA ACAGCAATCT TCTTATGCTT CTGAATAATC AAAGACTAAT TTGTGATTTT 12 00 

ACTTTTTAAT AGATATGACT TTGCTTCCAA CATGGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 



45 
50 
55 
60 

65 SEQ ID N0:234PFH2 Protein sequence: 



__ 1 11 21 31 41 51 

70 | I I III 

MNWEM,LWLL VLCALLLLIiV QLLRFLRADG DLTLtWAEWQ GRRPEWELTD MVW1VTGASS 
GIGEELAYQL SKLGVSLVLS AKRVHELERV KRRCLEHGNL KEKDILVLPL DLTDTGSHEA 
ATKAVLQEFG RIDILVNNGG MSQRSLCMDT SLDVYRKLTE LNYLGTVSLT KCVLPHMIER 
_ KQGKIVTVNS 1LGIISVPLS IGYCASKHAL RGFFNGLRTE LATYPGIIVS NICPGFVQSN • 
75 IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY : 

MPTWAWWITH KHGKKRIENF KSGVDADSSY FKIFKTKHD 

SEQ ID N0:235ACC5DNA SEQUENCE 

80 Nucleic Acid Accession*: NM_000450 
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1-1833 (underlined sequences correspond to start and stop cottons) 



ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TTCTCATTAA AGAGAGTGGA 
GCCTGGTCTT ACAACACCTC CACGGAAGCT ATGACTTATG ATGAGGCCAG TGCTTATTGT 
CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 



TGGGTCTGGG TAGGAACCCA GAAACCTCTG A 

GAACCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAGA TCTACATCAA G 
GATGTGGGCA TGTGGAATGA TGAGAGGTGC A 

GCTCCCTGTA CCAATACATC CTGCAGTGGC CACGGTGAAT GTGTAGAGAC CATCAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAGT GGACTCAAGT GTGAGCAAAT TGTGAACTGT 540 

ACAGCCCTGG AATCCCCTGA GCATGGAAGC CTGGTTTGCA GTCACCCACT GGGAAACTTC 600 

AGCTACAATT CTTCCTGCTC TATCAGCTGT GATAGGGGTT ACCTGCCAAG CAGCATGGAG 660 

ACCATGCAGT GTATGTCCTC TGGAGAATGG AGTGCTCCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTCGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTCA ACTAATGGGA 840 

GCCCAGAGCC TTCAGTGTAC CTCATCTGGG AATTGGGACA ACGAGAAGCC AACGTGTAAA 900 

GCTGTGACAT GCAGGGCCGT CCGCCAGCCT CAGAATGGCT CTGTGAGGTG CAGCCATTCC 960 

C TGCAACTTCA CCTGTCAGGA AGGCTTCATG 1020 

C ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

G CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 

CTTCCTAGTG GTTCTGGCAG TTTCCGTTAT GGGTCCAGCT GTGAGTTCTC CTGTGAGCAG 1200 

GGTTTTGTGT TGAAGGGATC CAAAAGGCTC CAATGTGGCC CCACAGGGGA GTGGGACAAC 1260 

GAGAAGCCCA CATGTGAAGC TGTGAGATGC GATGCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAGGTCTG CTCATTCCCC TATTGGAGAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 

TGTGAGGAGG GATTTGAATT ATATGGATCA ACTCAACTTG AGTGCACATC TCAGGGACAA 1440 

TGGACAGAAG AGGTTCCTTC CTGCCAAGTG GTAAAATGTT CAAGCCTGGC AGTTCCGGGA 1500 

AAGATCAACA TGAGCTGCAG TGGGGAGCCC GTGTTTGGCA CTGTGTGCAA GTTCGCCTGT 1560 

CCTGAAGGAT GGACGCTCAA TGGCTCTGCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1620 

TCTGGCCTGC TACCTACCTG TGAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTACGGA AAGCAAAGAA ATTTGTTCCT GCCAGCAGCT GCCAAAGCCT TGAATCAGAC 1800 



SEQJD NQ:236 ACp5 Prp|ein sequence: 



I I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 
SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPHNRQKDED CVEIY1KREK 
DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGFS GLKCEQIVNC 
TALESPEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 
ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG NVTONEKPTCK 
AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 
VCEAF OCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 
EKPTCEAVHC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 
3LAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 



3HIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 



SEQ ID NO:237 PM28 DNA SEQUENCE 

lion #: N51002 

Coding sequence: 1-3793 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGATGTGTG AAGTGATGCC CACGATTAAT GAGGACACCC CAATGAGCCA AAGGGGGTCC 
CAAAGCAGTC GCTCGGACTC AGACTCCCAT TTTGAGCAGC TGATGGTCAA TATGCTAGAT 
GAAAGGGATC GTCTTCTAGA CACCCTTCGG GAGACCCAGG AAAGCCTCTC ACTTGCCCAG 
CAAAGACTTC AGGATGTCAT CTATGACCGA GACTCACTCC AGAGACAGCT CAATTCAGCC 
CTGCCACAGG ATATCGAATC CCTAACAGGA GGGCTGGCTG GTTCTAAGGG GGCTGATCCA 
CCGGAATTTG CTGCACTGAC AAAAGAATTA AATGCCTGCA GGGAACAACT TCTAGAAAAG 
GAAGAAGAAA TCTCTGAACT TAAAGCTGAA AGAAACAACA CAAGACTATT ACTGGAGCAT 
TTGGAGTGCC TTGTGTCACG ACATGAAAGA TCACTAAGAA TGACGGTGGT AAAACGGCAA 
GCCCAGTCTC CCTCAGGAGT ATCCAGTGAA GTTGAAGTTC TCAAGGCACT GAAATCTTTG ! 
TTTGAGCACC ACAAGGCCTT GGATGAAAAG GTAAGGGAGC GACTGAGGGT TTCTTTAGAA I 
AGAGTCTCTG CACTGGAAGA AGAACTAGCT GCTGCTAATC AGGAGATTGT TGCCTTGCGT I 
GAACAAAATG TTCATATACA AAGAAAAATG GCATCAAGCG AGGGATC 
CATCTTGAAG GGATGGAACC TGGACAGAAA GTCCATGAGA AGCGTTTGTC C 
ATAGACTCAA CCGATGAAAC TAGTCAAATA GTTGAACTAC AAGAATTGCT T 
AACTATGAAA TGGCCCAGAT GAAAGAACGT TTAGCAGCCC TTTCTTCCCG A 
GTGGAACAGG AAGCAGAGAC AGCAAGAAAG GATCTCATTA AAACAGAAGA AATGAACACC 960 
AAGTATCAAA GGGACATTAG GGAGGCCATG GCACAAAAGG AAGATATGGA AGAAAGAATT 1020 
ACAACCCTTG AAAAGCGTTA CCTCAGTGCT CAGAGAGAAT CTACCTCCAT ACATGACATG 1080 
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AATGATAAAC TAGAAAATGA GTTAGCAAAT AAAGAAGCTA TCCTACGGCA GATGGAAGAG 1140 

AAAAACAGAC AGTTACAAGA ACGTCTTGAG CTAGCTGAAC AAAAGTTGCA GCAGACCATG 1200 

AGAAAGGCTG AAACCTTGCC TGAAGTAGAG GCTGAACTGG CICAGAGAAT TGCAGCCCTA 1260 

ACCAAGGCTG AAGAGAGACA TGGAAATATT GAAGAACGTA TGAGACATTT AGAGGGTCAA 1320 

CTTGAAGAGA AGAATCAAGA ACTTCAAAGA GCTAGGCAAA GAGAGAAAAT GAATGAGGAG 1380 

CATAACAAGA GATTATCGGA TACGGTTGAT AGACTTCTGA CTGAATCCAA TGAACGCCTA 1440 

CAACTACACT TAAAGGAAAG AATGGCTGCT CTAGAAGAAA AGAATGTTTT AATTCAAGAA 1500 

TCAGAAACTT TCAGAAAGAA TCTTGAAGAA TCTTTACATG ATAAGGAAAG ATTAGCAGAA 1560 

GAAATTGAAA AGCTGAGATC TGAACTTGAC CAATTGAAAA TGAGAACTGG CTCTTTAATT 1S20 

GAACCCACAA TACCAAGAAC TCATCTAGAC ACCTCAGCTG AGTTGCGGTA CTCAGTGGGA 1680 

TCCCTAGTGG ACAGCCAGTC TGATTACAGA ACAACTAAAG TAATAAGAAG ACCAAGGAGA 1740 

GGCCGCATGG GTGTGCGAAG AGATGAGCCA AAGGTGAAAT CTOTTGGGGA TCACGAGTGG 1S00 

AATAGAACTC AACAGATTGG AGTACTAAGC AGCCACCCTT TTGAAAGTGA CACTGAAATG 1860 

TCTGATATTG ATGATGATGA CAGAGAAACA ATTTTTAGCT CAATGGATCT TCTCTCTCCA 1920 

AGTGGTCATT CCGATGCCCA GACGCTAGCC ATGATGCTTC AGGAACAATT GGATGCCATC 1980 

AACAAAGAAA TCAGGCTAAT TCAGGAAGAA AAAGAATCTA CAGAGTTGCG TGCTGAAGAA 2040 

ATTGAAAATA GAGTGGCTAG TGTGAGCCTC GAAGGCCTGA ATTTGGCAAG GGTCCACCCA 2100 

GGTACCTCCA TTACTGCCTC TGTTACAGCT TCATCGCTGG CCAGTTCATC TCCCCCCAGT 2160 
A CTCCAAAGCT CACCCCTCGA AGCCCTGCCA G 



\ TCTGAGGAAA CATCGGAGAA AGATTGCAGT TGTGGAAGAA 2280 

G AGGACAAAGC AACAATTAAA TGTGAAACTT CTCCTCCTCC TACCCCTAGA 2340 

GCCCTCAGAA TGACTCACAC TCTCCCTTCT TCCTACCACA ATGATGCTCG AAGTAGTTTA 2400 

TCTGTCTCTC TTGAGCCAGA AAGCCTCGGG CTTGGTAGTG CCAACAGCAG CCAAGACTCT 2460 

CTTCACAAAG CCCCCAAGAA GAAAGGAATC AAGTCTTCAA TAGGACGTTT GTTTGGTAAA 2520 



CCAACTGTGG TCGCATGGCT AGAGCTTTGG TTGGGAATGC CTGCGTGGTA CGTGGCAGCC 



T GGGCCCAGTG TCCGGTTTTT CTACAGACCC T 
CATGAGTGGA TTGGAAATGA ATG5CTTCCC AGCTTGGGGT TACCTCAGTA C 
TTTATGGAAT GCTTGGTAGA TGCAAGAATG TTAGATCACC TAACAAAAAA A 

GTCCATTTAA AAATGGTGGA TAGTTTCCAT CGAACAAGTT TACAATATGG AATTATGTGC 3240 

TTAAAGAGGT TGAATTATGA CAGAAAAGAA CTAGAAAGAA GACGGGAAGC AAGCCAACAT 3300 

GAAATAAAAG ACGTGTTGGT GTGGAGCAAT GACCGAATTA TTCGCTGGAT ACAAGCAATT 33 60 

GGACTTCGAG AATATGCAAA TAATATACTT GAGAGCGGTG TGCATGGCTC ACTTATAGCC 3420 

ACTTTGACTA CAGCAGCTTA ACTTTATTAT TACAGATTCC AACACAGAAC 3480 

GGCAGATTCT TGAAAGAGAA TACAATAACC TCTTGGCCCT GGGAACTGAA 3540 

A TGACAAGAAC TTCAGACGTG GATCAACCTG GAGAAGGCAG 3600 

A TGGAATCAGC ATGATGCCTG GGTCCTCAGA AACATTACCA 3660 

C AACCTCTGGG CAATCAAGAA AAATGACAAC AGATGTTGCT 3720 

T AGACAACTCC ACTGTTCGCA CATACTCATG TCTCGAGTAA 3780 




1 11 21 31 41 51 

I I I I I I 

MMCEVMPTIN EDTPMSQRGS QSSGSDSDSH FEQLMWJMLD ERDRLLDTLR ETQESLSLAQ 60 

QRLQDVIYDR DSLQRQLNSA LPQDIESLTG GLAGSKGADP PEFAALTKEL NACREQLLEK 120 

EEEISELKAE RNNTRLLLEH LECLVSRHER SLRMTWKRQ AQSPSGVSSE VEVLKALKSL 180 

FEKHKALDEK VRERLRVSLE RVSALEEELA AANQEIVALR EQNVHIQRKM ASSEGSTESE 240 

HLEGMEPGQK VHEKRLSNGS IDSTDETSQI VELQELLEKQ HYEMAQMKER LAALSSRVGE 300 

VEQEAETARK DLIKTEEMNT KYQRDIRBAM AQKEDMEERI TTLEKRYLSA QRESTSIHDM 360 

NDKLENELAN KEAILRQMEE KNRQLQERLE LAEQKLQQTM RKAETLEEVE AELAQRIAAB 420 

TKAEERHGMI EERMRHLEGQ LEEKNQELQR ARQEEKMNEE HNKRLSDTVD RLLTESNERL 480 

QLHLKERMAA LEEKNVLIQE SETPRKNLEE SLHDKERLAE EIEKLRSELD QLKMRTGSI.I 540 

EPTIPRTHLD TSAELRYSVG SLVDSQSDYR TTKVIRRPRR GRMGVRRDEP KVKSLGDHEW 600 

NRTQQIGVLS SHPPESDTEM SDIDDDDRET IFSSMDLLSP SGHSDAQTLA MMLQEQLDAI 660 

NKEIRLIQEE KESTELRAEE IENRVASVSL EGLNLARVHP GTSITASVTA SSLASSSPPS 720 

GHSTPKLTPR SPAREKDRMG VMTLPSDLRK HRRKIAWEE DGREDKATIK CETSPPPTPR 780 

ALRMTHTLPS SYHNDARSSL SVSLEPESLG LGSANSSQDS LHKAPKKKGI KSSIGRLFGK 840 

KEKARLGQLR GFKETEAAAQ ESLGLGKLGT QAEKDRRLKK KHELLEEARR KGLPFAQWDG 900 

PTWAWLELW LGMPAWYVAA CRANVKSGAI MSALSDTEIQ REIGISNPLII RLKLRLAIQE 960 

HVSLTSPSAP PTSRTPSGNV WVTHEEMEHL AAPAKTKESE EGSWAQCPVF LQTLAYGDMN 1020 

HEWIGNEWLP SLGLPQYRSY FMECLVDARM LDHLTKKDLR VHLKMVDSFH RTSLQYGIMC 1080 

LKRLNYDRKE LERRREASQH EIKDVLVWSN DRIIRWIQAI GLREYANNIL ESGVHGSLIA 1140 

LDENFDYSSL TLLLQIPTQN TQARQILERE YNNLLALGTE RRLDESDDKN FRRGSTWRRQ 1200 
FPPREVHGIS MMPGSSETLP AGFRLTTTSG QSRKMTTDVA SSRLQRLDNS TVRTYSCLE 

Nucleic Acid Accession*; 
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1 11 21 31 41 51 

I I I I I I 

ATGAGGCGAC TGAATCGGAA AAAAACTTTA AGTTTGGTAA AAGAGTTGGA TGCCTTTCCG 60 

AAGGTTCCTG AGAGCTATGT AGAGACTTCA GCCAGTGGAG GTACAGTTTC TCTAATAGCA 120 

TTTACAACTA TGGCTTTATT AACCATAATG GAATTCTOAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAGGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTTGCCA TGAAGTGTCA ATATGTTGGA GCGGATGTAT TGGATTTAGC AGAAACAATG 300 

GTTGCATCTG CAGATGGTTT AGTTTATGAA CCAACAGTAT TTGATCTTTC ACCACAGCAG 360 

AAAGAGTGGC AGAGGATGCT GCAGCTGATT CAGAGTAGGC TACAAGAAGA GCATTCACTT 420 

CAAGATGTCA TATTTAAAAG TGCTTTTAAA AGTACATCAA CAGCTCTTCC ACCAAGAGAA 480 

GATGATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TGTCAATAAA 540 

GTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCG TGGTCATGCA 600 

CATTTGGCAG CACTTGTCAA CCATGAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 660 

TCTTTTGGAG AGCTTGTTCC AGCAATTATT AATCCTTTAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAGCAGACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGG ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATTGTTGGAG GAATCTTTTC AACAACAGGC ATGTTACATG GAATTGGAAA ATTTATAG1T 1020 

GAAATAATTT GCTGTCGTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAGGATGGCC ACACAGACAA CCACTTACCT CTTTTAGAAA ATAATACACA TTGA 



:ELDAFP KVPESYVETS ASGGTVSLIA FTTMALLTIM EFSVYQDTWM 
KYEYETOKDF SSKLRMIDI TVAMKCQYVG ADVLDLAETM VASADGLVYE PTVFDLSPQQ 
KEWQRMLQLI QSRLQEEHSL QDVIFKSAFK STSTALPPRE DDSSQSPNAC RIHGHLYVMK 
VAGNFHITVG KAIPHPRGHA HLAALVNHES YNFSHRIDHL SFGELVPAII NPLDGTEKIA 
IDHNQMFQYF I 



SEQ ID NO:241 PBA7 DNA SEQUENCE 

Nucleic Acid Accession*: AA219134 

Coding sequence: 24-1 815 (undeiiined sequences con-espond to start and stop codons) 



AATTCGCCCT TGCTTAATTA AGCATQTTTA CCTTCCTGTC ATCTGTCACT GCTGCTGTCA 60 
GTGGCCTCCT GGTGGGTTAT GAACTTGGGA TCATCTCTGG GGCTCTTCTT CAGATCAAAA 120 
CCTTATTAQC CCTGAGCTGC CATGAGCAGG AAATGGTTGT GAGCTCCCTC GTCATTGGAG 1 80 
CCCTCCTTGC CTCACTCACC GGAGGGGTCC TGATAGACAG ATATGGAAGA AGGACAGCAA 240 
TCATCTTGTC ATCCTGCCTG CTTGGACTCG GAAGCTTAGT CTTGATCCTC AGTTTATCCT 300 
ACACGGTTCT TATAGTGGGA CGCATTGCCA TAGGGGTTTC CATCTCCCTC TCTTCCATTG 360 
CCACTTGTGT TTACATCGCA GAGATTGCTC CTCAACACAG AAGAGGCCTT CTTGTGTCAC 420 
TGAATGAGCT GATGATTGTC ATCGGCATTC TTTCTGCCTA TATTTCAAAT TACGCATTTG 480 
CCAATGTTTT CCATGGCTGG AAGTACATGT TTGGTCTTGT GATTCCCTTG GGAGTTTTGC 540 
AAGCAATTGC AATGTATTTT CTTCCTCCAA GCCCTCGGTT TCTGGTGATG AAAGGACAAG 600 
AGGGAGCTGC TAGCAAGGTT CTTGGAAGGT TAAGAGCACT CTCAGATACA ACTGAGGAAC 660 
TCACTGTGAT CAAATCCTCC CTGAAAGATG A ATATCAGTA CAGTTTTTGG G ATCTGTTTC 720 
GTTCAAAAGA CAACATGCGG ACCCGAATAA TGATAGGACT AACACTAGTA TTTTTTGTAC 780 
AAATCACTGG CCAACCAAAC ATATTGTTCT ATGCATCAAC TGTTTTGAAG TCAGTTGGAT 840 
TTCAAAGCAA TGAGGCAGCT AGCCTCGCCT CCACTGGGGT TGGAGTCGTC AAGGTCATTA 900 
GCACCATCCC TGCCACTCTT CTTGTAGACC ATGTCGGCAG CAAAACATTC CTCTGCATTG 960 
GCTCCTCTGT GATGGCAGCT TCGTTGGTGA CCATGGGCAT CGTAAATCTC AACATCCACA 1020 
TGAACTTCAC CCATATCTGC AGAAGCCACA ATTCTATCAA CCAGTCCTTG GATGAGTCTG 1080 
TGATTTATGG ACCAGGAAAC CTGTCAACCA AC AAC AATAC TCTCAG AGAC CACTTCAAAG 1 140 
GGATTTCTTC CCATAGCAGA AGCTCACTCA TGCCCCTGAG AAATGATGTG GATAAGAGAG 1200 
GGGAGACGAC CTCAGCATCC TTGCTAAATG CTGGATTAAG CCACACTGAA TACCAGATAG 1260 
TCACAGACCC TGGGGACGTC CCAGCTTTTT TGAAATGGCT GTCCTTAGCC AGCTTGCTTG 1320 
TTTATGTTGC TGCTTTTTCA ATTGGTCTAG GACCAATGCC CTGGCTGGTG CTCAGCGAGA 1380 
TCTTTCCTGG TGGGATCAGA GGACGAGCCA TGGCTTTAAC TTCTAGCATG AACTGGGGCA 1440 
TCAATCTCCT CATCTCGCTG ACATTTTTGA CTGTAACTGA TCTTATTGGC CTGCCATGGG 1500 
TGTGCTTTAT ATATACAATC ATGAGTCTAG ATCTTATTGG CCTGCCATGG GTGTGCTTTA 1560 
TATATACAAT CATGAGTCTA GCATCCCTGC TTTTTGTTGT TATGTTTATA CCTGAGACAA 1620 
AGGGATGCTC TTTGGAACAA ATATCAATGG AGCTAGCAAA AGTGAACTAT GTGAAAAACA 1680 
ACATTTGTTT TATGAGTCAT CACCAAGAAG AATTAGTGCC AAAACAGCCT CAAAAAAGAA 1740 
AACCCCAGGA GCAGCTCTTG GAGTGTAACA AGCTGTGTGG TAGGGGCCAA TCCAGGCAGC 1800 
TTTCTCCAGA GACCTAATGG CCTCAACACC TTCTGAACGT GGATAGTGCC AGAACACTTA 1860 
GGAGGGTGTC TTTGGACCAA TGC ATAGTTG CGACTCCTGT GCTCTCTTTT CAGTGTCATG 1 920 
GAACTGGTTT TGA AGAG ACA CTCTGAAATG ATAAAGACAG CCTTTAATCC CCCTCCTCMC 1980 
CAGAAGGAAC CTCAAAAGGT AGATGAGGTA CAAGGTCCTA AGTGATCTCT TTTTCTGAGC 2040 
AGGATATCAG GTTAAAAAAA AAAAGTTACT GGCTGGTTTA ATACTTTCTA CCTTCTTCAC 2100 
AGAGCAGCCT TTGAATAGAC TATGTCCTAG TGAAGACATC AACCTCCGCC TTAAGCTATG 2160 
TATGTATGGA GGCCAGTCGC AGCTTTATTA TGCAGACACA CAAGTGGTCT GGACATGAGG 2220 
GTACAGTTTC TGCCTACCAA GACACTACTT GCACTGGATC TTACGCAAAA AAGAACCAGA 2280 
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ACACACAGTG TCGACAACTG CCCATATATT CTATCTAGAT TAGGAGAOGG TCCTGGCTAG 2340 
GATTTTAGTG GTAATTCCTA GTTACATTCA ACAAGTATAA AGATTATAGA GCTTATTTTA 2400 
TGAACTATAA ACTATAATTT AATGCAAA AT ATCCTTTTAT GAATTTCATG TTAATATTGT 2460 
GAAATATTAA AATAATTCCR CAATAGTTGA GAAAAATGAG CATTTTTTTC CATTTTTAAA 2320 
AAATGCATAG AAAAGACAAT TTTAAA ATCC TGGGACCATA TTTATTTAGA AGTAGCTGTT 2580 
AGTAAAACAT TAGAAAAGGA GTCAGGCCAT TAGGTTATTT ATCCAAATCT CI 
AGGTTGAAGT TATTAAGTCA AGCCTAGAAA AGCTGCCTCC TTGTAAGGCT Tl 
TGTATAGTAA TCCACAGTGT CCAATTCTTC ACACTCCTCA GGAATATCAC TACCTCAGGT 2760 
TACGGTACAC AGGCTATAAT TGATGATGAT GTTCAGATAA CTGAAGACAC AATAAATGAC 2820 
ATTCAGACAT CAGGAMAAWW CCCTCATGTT CTTTTCTATG ATGGCCACCT GTACCAGCAA 2880 
CGTGGGTTTC ACCCACACAA CGATGAACTG TTCTCTTACT TCTCCAGTTG ATTTTAAAGA 2940 
CTTGTTAAGA GGTCTTACTA ATAAAATTTG GGTATGATAG AAAAWCCACA ATCAAAWCTT 3000 
GAACCAAATA ACATATTAAA TTACTAATAT TTAAGTGATG GAAGACACAC AAAAAACTTA 3060 
AAAGCACGAA CAACCTAACT TGAAAAAGAA TTTTAAAATA TGATTAACCT GAAGAAAAGA 3120 
GAATCCTAAG AGCCAAAGCT CCTTTTTATT TAGCTTGGAA TTTTCCTATT GGTTCCTAAC 3180 
AAACTGTCCC AATGTCATAT AAGGAAACAT GATCTATTAC AITCCTTTAT AACAATGTGG 3240 
AGAGACTATA AACCTATGTA AGTAGTAAAA CTATATYAGA GACTCAGGAG ACTGACTAAA 3300 
AGGCCTGGAT CTGCAGTGTA TTATCTGTAT AAAAATTGGC AGGGGGAAGC TAAAAGGAAA 3360 
GGAGATTGGA GATCTCAATT CTATCATGGT GTATTTCATA CGCAAATCAG AGCATGCATT 3420 
GTTTTTTGTT TTTGGAAAGA GAAGGGAAGT GTGTTCTGCC CCATGTTTCC TTCCGTGTTT 3480 
ATAGTTCAAA CTCTATATAT ACTTCAGGTA TTTTTTGTTT AGCCCTTCAT TATAAATGGG 3540 
CAGGAAATTG TTTATCAACC TAGCCAGTTT ATTACTAGTG ACCTTGACTT CAGTATCTTG 3600 
AGCATTCTTT TATATTTTTC nTTATTATC CTGAGTCTGT AACTAAACAA TTTTGTCTTC 3660 
AAATTTTTAT CCAATATCCA TTGCACCACA CCAAATCAAG CTTCTTGATT TTCAAAAATA 3720 
AAAAGGGGG A AATACTTACA ACTTGTACAT ATATATTCAC AGTTTTTATT TATAAAAAAA 3780 
ATTTACAGTA CTTATGGAGA GCCAGCAGAA GACATCAGAG CACTCACTTC TTCCCATCTT 3840 
TGTTAAGGTT AGCGAATTAC CCATGGACAC TGTTAGGTGA GGCTCATTCG GCAGCCCTGA 3900 
AAACAAACCT GGTCACACTG TCTTTACCCT CTCCCTTCAG ATAAAGCACT TCGATTATCT 3960 
ATTGATCTGC CCAGTTTTCA AGTCATGCGA ATACTAAAAA GGTTACATCA TCTGGATCTG 4020 
TACCTTGGCT ATATAAGCAT GTTrTCCCCC TATTCTATGT TTCTTTTTTT GGTGAACATT 4080 
GAAAAACAGG AGGTGACTTA TTACTGTTAA TTAAAACTAA ATGAAAAATG TCAAGTCTTT 4140 
AAAACAGTGA GCTTGTAACT CTTTCATGTA ATTTTATTCT CTATGAATTT GGCTATCCTA 4200 
CTGAATCTTA AAATAAAGGA AATAAACACT TTTTTTTWAA AAAAAAGGAA AAATAMAARW 4260 
MWAAAAATCT CAATGAAATA TTTCACAAGA AGGAAAAA 

$EQ ID NO:242 PBA7 Protein sequence : 

Protein Accession*: AAF91431 

MFTFLSSVTA AVSGLLVGYE LGIISGALLQ IKTLLALSCH EQEMWSSLV IGALLASLTG 60 
GVUDRYGRR TAIILSSCLL GLGSLVL1LS LSYTVLIVGR IAIGVSISLS SIATCVYIAE 120 
IAPQHRRGLL VSLNELMIVI GILSAYISNY AFANVFHGWK YMFGLV1PLG VLQAIAMYFL 180 
PPSPRFLVMK GQEGAASKVL GRLRALSDTT EELTVIKSSL KDEYQYSFWD LFRSKDNMRT 240 
RIMIGLTLVF FVQITGQPNI LFYASTVLKS VGFQSNEAAS LASTGVGVVK VIST1PATLL 300 
VDHVGSKTFL CIGSSVMAAS LVTMGIVNLN fflMNFTHICR SHNS1NQSLD ESVIYGPGNL 360 
STNNNTLRDH FKGISSHSRS SLMPLRNDVD KRGETTSASL LNAGLSHTEY Q1VTDPGDVP 420 
AFLKWLSLAS LLVYVAAFSI GLGPMFWLVL SEIFPGGIRG RAMALTSSMN WGINLLISLT 480 
FLTVTDLIGL PWVCFIYTIM SLDLIGLPWV CHYT1MSLA SLIFWMFIP ETKGCSLEQI 540 
SMELAKVNYV KNNICFMSHH QEELVFKQPQ KRKPQEQLLE CNKLCGRGQS RQLSPET 



121-339 (underlined sequences correspond to start and slop codons) 



TTTAGCCACC AGAGGANTTC TCTTGAAATA CCCAAAATCC ATCAGTATCT TC 
TGGATTTTGA AGAATTCTTA AG AAGCCATG TAAAGGGGGC TCTCTGGCCT TGAAATAG 1'G 120 

AisrrrrrrA tacagaaagg agaatgcaga atggtcagac tai catgcac tgttaaattt iso 

GATTTCAAGA AATTACAGGA AAACTTTCCA AAGTTCCATC TCACAGAANN TTATTTTNCC 240 
AAGAATTCCA AGATAAGTTT AGTTTTATGG AAGACITTTA TGTGGTTTTT ACTCACTCTT 300 
CATCTCAGAC ATCGACAGAT GATTACATCA CTTATAGTTC TAGTAAATTT ATTAATATAA 360 
AACTCAGAGA CATTCCAATA TCCACATTGC TTACACCATT AGGCATAGAT TCAGTGTCAG 420 
CTATG ACAAT TG AAAATGAG CTGTTTTGTG ATTTAAAGGT TTAAATTTCT CTAACCAAAC 480 
TGCTTGATCC AGATGCAGGA CTGCAAATGT TAATATTTGT TCTGGAAGAA CAATCAAATA 540 
AGACTTAAGA GGAAAGGGAA TGGCCACAAT CCACCTGAAA TTTTTTCTTA AAAAGTGTGC 600 
AGCCTACTAA ATCAGAATGA AAATAGAAGT ACAAGATTAT AAACAAAATG CAATCAAACT 660 
TTTCTTAAGC TTACCTAAAG TTATTTCATC TGAAAATTTC AAGCAACTTT GTTCAACATT 720 
AAATTGACAA TCTAAACTAA CAAGTCTTTT GAATTTATGC ATGGTAGTAA ACATTCTCTC 780 
TATTAACTTT ATTACCTAAG GCTAAACCTA AAATTTTTAA GCAAAATTAG AAAAATAGTC 840 
TTCACTCATC AAAAAATAAA GTTTGTTACA TTTAGTATTT TCCCAATAAA ATTGGTCGTT 900 
CTTGGTTTTT TATTTGG AG A GTCTGTGCA A AATGTCACTA AAAATAAATT AGCACT AG AA 960 
ATTATTTCTA AATACCAAA 

SEQ ID NO:244 PBQ8 DNA SEQUENCE 
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I I I I I I 

AARTGGCGTG CCCGTCTCTC CGCCGGCCCC CTGCCTCGCA GTGGTTTCTC CTGCAGCTCC 
CCTGGGCTCC GCGGCCAGTA GTGCAGCCCG TGGAGCCGCG GCTTTGCCCG TCTCCTCTGG 
GTGGCCCCAG TGCGCGGGCT GACACTCATT CAGCCGGGGA AGGTGAGGCG AGTAGAGGCT 
GGTGCGGAAC TTGCCGCCCC CAGCAGCGCC GGCGGGCTAA GCCCAGGGCC GGGCAGACAA 
AAGAGGCCGC ccgcgtagga aggcacggcc GGCGGCGGCG GAGCGCAGCG ATGGCCGGGC 
G GCTCTGTGCG GGGCACTGGC TGCCTGCGGG TGGCTCCTGG 



AGGGCCGGGA GCTCCTGGTC ATCGAGCTGT CCGACAACCC TGGCGTCCAT GAGCCTGGTG 
AGCCTGAATT TAAATACATT GGGAATATGC ATGGGAATGA GGCTGTTGGA CGAGAACTGC 
TCATTTTCTT GGCCCAGTAC CTATGCAACG AATACCAGAA GGGGAACGAG ACAATTGTCA 
ACCTGATCCA CAGTACCCGC ATTCACATCA TGCCTTCCCT GAACCCAGAT GGCTTTGAGA 
AGGCAGCGTC TCAGCCTGGT GAACTCAAGG ACTGGTTTGT GGGTCGAAGC AATGCCCAGG 
GAATAGATCT GAACCGGAAC TTTCCAGACC TGGATAGGAT AGTGTACGTG A 
AAGGTGGTCC AAATAATCAT C 

CTGTCATTC ATTGGATTAT GGATATTCCT TTTGTGCTTT 

T TTTCCAAAGG TTGGCCCGGG 

CATACTCTTC TTTCAACCCG GCCATGTCTC ACCCCAATCG GCCACCATGT CGCAAGAATG 1200 

ATGATGACAG CAGCTTTGTA GATGGAACCA CCAACGGTGG TGCTTGGTAC AGCGTACCTG 1260 

GAGGGATGCA AGACTTCAAT TACCTTAGCA GCAACTGTTT TGAGATCACC GTGGAGCTTA 1320 

GCTGTGAGAA GTTCCCACCT GAAGAGACTC TGAAGACCTA CTGGGAGGAT AACAAAAACT 1380 

CCCTCATTAG CTACCTTGAG CAGATACACC GAGGAGTTAA AGGATTTGTC CGAGACCTTC 1440 

AAGGTAACCC AATTGCGAAT GCCACCATCT CCGTGGAAGG AATAGACCAC GATGTTACAT 1500 

CCGCAAAGGA TGGTGATTAC TGGAGATTGC TTATACCTGG AAACTATAAA CTTACAGCCT 1560 

CAGCTCOAGG CTATCTGGCA ATAACAAAGA AAGTGGCAGT TCCTTACAGC CCTGCTGCTG 1620 

GGGTTGATTT TGAACTGGAG TCATTTTCTG AAAGGAAAGA AGAGGAGAAG GAAGAATTGA 1680 

TGGAATGGTG GAAAATGATG TCAGAAACTT TAAATTT TTA A AAAGGCTTC TAGTTAGCTG 1740 

CTTTAAATCT ATCTATATAA TGTAGTATGA TGTAATGTGG TCTTTTTTTT AGATTTTG'TC! 1800 

CAGTTAATAC TTAACATTGA TTTATTTTTT AATCATTTAA ATATTAATCA ACTTTCCTTA 1860 

AAATAAATAG CCTCTTAGGT AAAAATATAA GAACTTGATA TATTTCATTO TCTTATATAG 1920 

1980 



TCTAGCTTTC AAAAATTAGT GAAGTTCTTT TACTCTAATT GGTGACAATG TCACATAATG 2100 

AATGCTATTG AAAAGGTTAA CAGATACAGC TCGGAGTTGT GAGCACTCTA CTGCAAGACT 2160 

TAAATAGTTC AGTATAAATT GTCGTTTTTT TCTTGTGOTG ACTAACTATA AGCATGATCT 2220 

TGTTAATGCA TTTTTGATGG GAAGAAAAGG TACATGTTTA CAAAGAGGTT TTATGAAAAG 2280 

AATAAAAATT GACTTCTTGC TTGTACATAT AGGAGCAATA CTATTATATT ATGTAGTCCG 2340 

TTAACACTAC TTAAAAGTTT AGGGTTTTCT CTTGGTTGTA GAGTGGCCCA GAATTGCATT 2400 
CTGAATGAAT AAAGGTTAAA AAAAAATCCC CAGTGAAAAA AAA 



P16870 

MAGRGGS ALL ALCGALAACG WLLGAEAQEF GAPAAGMRRR RRLQQEDGIS EEYHRYPELR 60 
EALVSVWLQC TAISRIYTVG RSFEGRELLV 1ELSDNPGVH EPGEPEEKY1 GNMHGNEAVG 120 
RELLIFLAQY LCNEYQKGNE TIVNLIHSTR FHIMPSLNPD GFEKAASQPG ELKDWFVGRS 180 
NAQGIDLNRN FPDLDR1VYV NEKEGGPNNH LLKNMKKTVD QNTKLAPETK A VrHWIMDIP 240 
FVLSANLHGG DLVANYPYDE TRSGS AHEYS SSPDDAIFQS LARAYSSFNP AMSDPNRPPC 300 
RKNDDDSSFV DGTTNGGAWY SVPGGMQDFN YLSSNCFEIT VELSCEKFPP EETLKTYWED 360 
NKNSLISYLE QIHEGVKGFV RDLQGNPIAN ATIS VEGIDH DVTSAKDGDY WRLLIPGNYK 420 
LTASAPGYLA ITKKVAVPYS PAAGVDFELE SFSERKEEEK EELMEWWKMM SETLNF 



91-1 1 07 (undefined sequence cotresponds to start and stop codon) 



GGGGCGACGT GAGCGCGCAG GGGGGCGGCG GCCTCGCCTC GTOTCTCTCT CTGCGCCTGG 60 

GTCGGGTGGG TGACGCCGAG AGCCAGAGAG ATGTCGGATT TCGACAGTAA CCCGTTTGCC 120 

GACCCGGATC TCAACAATCC CTTCAAGGAT CCATCAGTTA CACAAGTGAC AAGAAATGTT 180 

CCACCAGGAC TTGATGAATA TAATCCATTC TCGGATTCTA GAACACCTCC ACCAGGCGGT 240 

GTGAAGATGC CTAATGTACC CAATACACAA CCAGCAATAA TGAAACCAAC AGAGGAACAT 300 

CCAGCTTATA CACAGATTGC AAAGGAACAT GCATTGGCCC AAGCTGAACT TCTTAAGCGC 360 

CAAGAAGAAC TAGAAAGAAA AGCCGCAGAA TTAGATCGTC GGGAACGAGA AATGCAAAAC 420 

CTCAGTCAAC ATC3GTAGAAA AAATATTTGG CCACCTCTTC CTAGCAATTT TCCTGTCGGA 480 

CCFTGTTTCT ATCAGGAATT TTCTGTAGAC ATTCCTGTAG AATTCCAAAA GACAGTAAAG 540 

CTTATGTACT ACTTGTGGAT GTTCCATGCA GTAACACTGT TTCTAAATAT CTTCGGATGC 600 

TTGGCTTGGT TITGTGTTGA TTCTGCAAGA GCGGTTGATT TTGGATTCAG TATCCTGTGG 660 

TTCTTGCTTT TTACTCCTTG TTCATTTGTC TGTTGGTACA GACCACTTTA TGGAGCTTTC 720 

AGGAGTGACA GTTCATTTAG ATTCTTTGTA TTCTTCTTCG TCTATATTTG TCAGTTTGCT 780 

GTACATGTAC TCCAAGCTGC AGGATTTCAT AACTGGGGCA ATTGTGGTTG GATTTCATCC 840 

CTTACTGGTC TCAACCAAAA TATTCCTGTT GGAATCATGA TGATAATCAT AGCAGCACTT 900 

TTCACAGCAT CAGCAGTCAT CTCACTAGTT ATGTTCAAAA AAGTACATGG ACTATATCGC 960 

ACAACAGGTG CTAGTTTTGA GAAGGCCCAA CAGGAGTTTG CAACAGGTGT GATGTCCAAC 1020 

AAAACTGTCC AGACCGCAGC TGCAAATGCA GCTTCAACTG CAGCATCTAG TGCAGCTCAG 1080 
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rTAAGAA TCTTCAAACA ATACACTGTT ACCTTTTGAC 1140 

TGTACCTTTT TCTCCAGTTA CTGTATTCTA CAAATATTTT TATGTTCAAA ACACACAGTA 1200 

CAGACAGCAT GGATATTTCC TGTTCACTTG TGCATGSGCT AAAACCAGGA AAACTTCCTT 1260 
GTCTTATTAC TTTACCTAAT AGTTTCTTAA TATTTCAGTG C 
ACATGCTAAA TAAATATTCT CCATATTTTT G 
GGTGACCCAC TGAAAATTAA TAATGG1 

CAGTAGTTCT TTCAAGAATC TTTAGAGATA AGGATTGCAC ATTGGAAAAG TAAACCATGT 1500 

T TTTCCCTATT TATATTGAAA GAAATAGGCC AGCAGAGACT TAGGGATTTT 1560 

T GCTTTTTAGC TGTTTCAGTC ACCAGTGAAG AGCCTATGTG CATTTTGTAG 1620 

T AAAATTTGTC ATCTTTTTCT TTTCTTTTTT TTAGAATAGC TGATATTTTG 1680 

T CTAATTTGCA TGGGCACCAC ATTTCTTATA TTAAAAGAAT TAGTGTTTTG 1740 



r TATCCAAAAG AGAAGGTAGT TAATGCAGTA GAAAGTAGTG GTAATAATTC 1920 



?E0IDNO:247PBY4Prplein^ 



MSDFDSNPFA DPDLNNPFKD PSVTQVTRNV PPGLDEYNPF SDSRTPPPGG VKMFNVPNTQ 60 
PAJMKPTEEH PAYTQIAKEH ALAQAELLKR QEELERKAAE LDRREREMQN LS QHGRKNIW 1 20 
PPLPSNFPVG PCFYQEFSVD IPVEFQKTVK LMYYLWMFHA VTLFLNIFGC LAWFCVDSAR 180 
AVDFGLSILW FLLFIPCSFV CWYRPLYGAF RSDSSFRFFV FFFVYICQFA VHVLQAAGFH 240 
NWGNCGWISS LTGLNQNIPV GIMMIHAAL FTASAVISLV MFKKVHGLYR TTGASFEKAQ 300 
QEFATGVMSN KTVQTAAANA ASTAASSAAQ NAFKGNQI 



ATGAGAGACA ATAAATCGTG TGCTTTTTTC ATGGGAAAGT TAAATGTTTG TTTTGAAGGC 60 
ACAGTAATAG CAGGCTATTC. AGTGTTTG CC ACTACCTGCA TCATTCATCT GGCTGTAGCT 120 
AGTGCACTAC AATTTCCTAA A A AGTCTTCT CACCCTCACA GGACTGCTCT ACATCTGGCC 180 
TCTGCCAATG GAAATTCAGA AGTAGTAAAA CTCCTGCTGG ACAGACGATG TCAACTTAAT 240 
ATCCTTGACA ACAAAAAGAG GACAGCTCTG ACAAAGGCCG TACAATGCCA GGAAGATGAA 300 
TGTGCGTTAA TGTTGCTGGA ACATGGCACT GATCCGA ATA TTCCAGATGA GTATGGAAAT 360 
ACCGCTCTAC ACTATGCTAT CTACAATGAA GATAAATTAA TGGCCAAAGC ACTGCTCTTA 420 
TACGGTGCTG ATATCGAATC AAAAAACAAG CATGGCCTCA CACCACTGTT ACTTGGTGTA 480 
CATGAGCAAA AACAGCAAGT GGTGAAATTT TTAATCAAGA AAAAAGCAAA TTTAAATGCA 540 
CTGGATAGAT ATGGAAGGTG TGTGACCTTG GGAACGTTAT TTACCACCAA ATATGTTGTC 600 
ATATATGAAA AG TAG 

$EC|iaNO:24?PpH2Prp|ginseqgencg: 
Protein Accession*: none found 

MRDNKSCAFF MGKLNVCFEG TVIAGYSVFA TTCIIHLAVA SALQFPKKSS HPHRTALHLA 60 
SANGNSEVVK LLLDRRCQLN ILDNKKRTAL TKAVQCQEDE CALMLLEHGT DPNIPDEYGN 120 
TALHYAIYNE DKLMAKALLL YGADIESKNK HGLTPLLLGV HEQKQQVVKF LIKKKANLN A 180 
LDRYGRCVTL GTLFTTKYVV 1YEK 



ATGGTGATCA TCTATCTTTC TTTCTGCAAT TATTACATGG AGTTCTACAG AGAAGAGCTT 60 
CCCCACATTG ACTATTTGAT TGACATTCAG TTTGCAACAG GAAAGGTTAC TCAGCCGGGA 120 
GAGGACACTT CCTACCATCA ATGCGCTCAG CTTGAAGCCA GAGACGAAGG CACCGACAGT 180 
TTATTATTAA ACAATGGCAG CAGCGCCACG CTGAAGACAC GAACGCGCTG TTATGGAACC 240 
CCCAGAGGTC TCCCCCATCG TAGCCTGCTC CAGCCGACTC CGCCCACATG TAAAACGAAG 300 
ATCAGGAGCA GATTTGA AGA ATTACAA AGT G AATTGGTGC CAGTC AGC AT GTCAGAGACA 3 60 
GACCACATAG CCTCTACTTC CTCTGATA AA AATGTTGGGA AAACACCTGA ATTAAAGGAA 420 
GACTCATGCA ACTTGTTTTC TGGCAATGAA AGCAGCAAAT TAGAAAATGA GTCCAAACTA 480 
TTGTCATTAA ACACTGATAA AACTTTATGT CAACCTAATG AGCATAATAA TCGAATTGAA 540 
GCCCAGGAAA ATTATATTCC AGATCATGGT GGAGGTGAGG ATTCTTGTGC CAAAACAGAC 600 
ACAGGCTCAG AAAATTCTGA ACAAATAGCT AATTTTCCTA GTGGAAATTT TGCTAAACAT 660 
ATTTCAAAAA CAAATGAAAC AGAACAGAAA GTAACACAAA TATTGGTGGA ATTAAGGTCA 720 
TCTACATTTC CAGAATCAGC TAATGAAAAG ACTTATTCAG AAAGCCCCTA TGATACAGAC 780 
TGCACCAAGA AATTTATTTC AAAAATAAAG AGCGTTTCAG CATCAGAGGA TTTGTTGGAA 840 
GAAATAGAAT CTGAGCTCTT ATCTACGGAG TTTGCAGAAC ATCGAGTACC AAATGGAATG 900 
AATAAGGGAG AACATGCATT AGTTCTGTTT GAAAAGTGTG TGCAAGATAA ATATTTGCAG 960 
CAGGAACATA TCATAAAAAA GTTAATTAAA GAAAATAAGA AGCATCAGGA GCTCTTCGTA 1020 
GACATTTGTT CAGAAAAAGA CAATTTAAGA GAAGAACTAA AGAAAAGAAC AGAAACTGAG 1080 
AAGCAGCATA TGAACACAAT TAAACAGTTA GAATCAAGAA TAGAAGAACT TAATAAAGAA 1 140 
GTTAAAGCTT CCAGAGATCA ACTAATAGCT CAAGACGTTA CAGCTAAAAA TGCAGTTCAG 1200 
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CAGTTACACA AAGAGATGGC CCAACGGATG GAACAGGCCA ACAAGAAATG TGAAGAGGCA 1260 
CGCCAAGAAA AAGAAGCAAT GGTAATGAAA TATGTAAGAG GTGAGAAGGA ATCTTTAGAT 1320 
CTTCGAAAGG AAAAAGAGAC ACTTGAGAAA AAACTTAGAG ATGCAAATAA GGAACTTGAG 1380 
AAAAACACTA ACAAAATTAA GCAGCTTTCT CAGGAGAAAG GACGGTTGCA CCAGCTGTAT 1440 
5 GAAACTAAGG AAGGCGAAAC GACTAGACTC ATCAGAGAAA TAGACAAATT AAAGGAAGAC 1500 
ATTAACTCTC ACGTCATCAA AGTAAAGTGG GCACAAAACA AATTAAAAGC TGAAATGGAT 1560 
TCACACAAGG AAACCAAAGA TAAACTCAAA GAAACAACAA CAAAATTAAC ACAAGCAAAG 1620 
GAAGAAGCAG ATCAGATACG AAAAAACTGT CAGGATATGA TAAAAACATA TCAGGAGTCA 1680 
GAAGAAATTA AATCAAATGA GCTTGATGCA AAGCTTAGAG TCACAAAAGG AGAACTTGAA 1740 

10 AAACAAATGC AAGAAAAATC TGACCAGCTA GAGATGCATC ATGCCAAAAT AAAGGAACTA 1800 
GAAGATCTGA AGAGAACATT TAAGGAGGGT ATGGATGAGT TAAGAACACT GAGAACAAAG 1860 
GTGAAATGTC TAGAAGATGA ACGATTAAGA ACAGAAGATG AATTATCAAA ATATAAGGAA 1920 
ATTATTAATC GCCAAAAAGC TGAAATTCAG AATTTATTGG ACAAGGTGAA AACTGCAGAT 1980 
CAGCTACAGG AGCAGC1TCA AAGAGGTAAG CAAGAAATTG AAAATTTGAA AGAAGAAGTG 2040 

15 GAAAGTCTTA ATTCTCTGAT TAATGACCTA CAAAAAGACA TCGAAGGCAG TAGGAAAAGA 2100 
GAATCTGAGC TGCTGCTGTT TACAGA AAGG CTCACTAGTA AGAATGCACA GCTTCAGTCT 2160 
GAATCCAATT CTTTGCAGTC ACAATTTGAT AAAGTTTCCT GTAGTGAAAG TCAGTTACAA 2220 
AGCCAGTGTG AACAAATGAA ACAGACAAAT ATTAATTTGG AAAGTAGGTT GTTGA AAG AG 2280 
GAAGAACTGC GAAAAGAGGA AGTCCAAACT CTGCAAGCTG AACTCGCTTG TAGACAAACA 2340 

20 GAAGTTAAAG CATTGAGTAC CCAGGTAGAA G AA'ITAA AAG ATGAGTTAGT AACTCAGAGA 2400 
CGTAAACATG CCTCTAGTAT CAAGGATCTC ACCAAACAAC TTCAGCAAGC ACGAAGAAAA 2460 
TTAGATCAGG TTGAGAGTGG AAGCTATGAC AAAGAAGTCA GCAGCATGGG AAGTCGTTCT 2520 
AGTTCATCAG GGTCCCTGAA TGCTCGAAGC AGTGCAGAAG ATCGATCTCC AGAAA ATACT 2580 
GGGTCCTCAG TAGCTGTGGA TAACTTTCCA CAAGTAGATA AGGCCATGTT GATTGAGAGA 2640 

25 ATAGTTAGGC TGCAAAAAGC ACATGCCCGG AAAAATGAAA AGATAGAATT TATGGAGGAC 2700 
CACATCAAAC AACTGGTGGA AGAAATTAGG AAAAAAACAA AAATAATTCA AAGTTATATT 2760 
TTACGAGAAG AATCAGGCAC ACTTTCTTCA GAGGCATCTG ATTTTAACAA AGTTCATTTA 2820 
AGTAGACGGG GTGGCATCAT GGCATCTTTA TATACATCCC ATCCAGCTGA CAATGGATTA 2880 
ACATTGGAGC TCTCTTTGGA AATCAACCGA AAATTACAGG CTGTTTTGGA GGATACGITA 2940 

30 CTAAAAAATA TTACTTTGAA GGAAAATCTA CAAACACTTG GAACAG AAAT AGAACGTCTT 3000 
ATTAAACACC AGCATGAACT AGAACAGAGG ACAAAGAAAA CCTAAAACAA GCCTCTTGCT 3060 
CAGTAAAGAG ACAAAAGCCA CACAGGAGTA GGTGCCACTG ACCTCTATTG TTGGAGACTT 3120 
TGTTCCACTT TTTGTTTCAG CCAGTAAAAA TATTGTTTTG CTTCATCTGT ACACAAAAAA 3 180 

„ _ ATACCCTTIT ACAATATG AA TGCAITGCTG TATATACTGT AAGACTGAAA GCnTGATGA 3240 

35 AATTTGTTTTTGTATGGTGC AATATGACAG CCTGTCATTG AATCTAAACA ACTTAATTTG 3300 
CTTGTATTCA TAAG AAGTGT TGAACATTAC AAGGGCTTTT AT 



SEQIDNO:251PBJ1 Protein sequence: 
40 Protein Accession*: NPJK0487 

MVIIYLSFCN YYMEFYREEL PHIDYLIDIQ FATGKVTQPG EDTSYHQCAQ LEARDEGTDS 60 
LIXNNGSSAT LKTRTRCYGT PRGLPHRSLL QPTPPTCKTK IRSRFEELQS ELVPVSMSET 120 
DHIASTSSDK N VGKTPELKE DSCNLFSGNE SSKLENESKL LSLNTDKTLC QPNEHNNRIE 180 

45 AQENYIPDHG GGEDSCAKTD TGSENSEQIA NFPSGNFAKH ISKTNETEQK VTQILVELRS 240 
STFPES ANEK TYSESPYDTD CTKKFISKIK SVSASEDLLE EIESELLSTE FAEHRVPNGM 300 
NKGEHALVLF EKCVQDKYLQ QEHUKKLIK ENKKHQELFV DICSEKDNLR EELKKRTETE 360 
KQIIMNTIKQL ESRIEELNKE VKASRDQLIA QDVTAKNAVQ QLHKEMAQRM EQANKKCEEA 420 
RQEKEAMVMK YVRGEKESLD LRKEKETLEK KLRDANKELE KNTNKKQLS QEKGRLHQLY 480 

50 ETKEGETTRL ffiEIDKLKED INSHV1KVKW AQNKLKAEMD SHKETKDKLK ETTTKLTQAK 540 
EEADQKKNC QDMIKTYQES EEIKSNELDA KLRVTKGELE KQMQEKSDQL EMHHAK1KEL 600 
EDLKRTFKEG MDELRTLRTK VKCLEDERLR TEDELSKYKE I1NRQKAEIQ NLLDKVKTAD 660 
QLQEQLQRGK QEtENLKEEV ESLNSUNDL QKDIEGSRKR ESELLLFTER LTSKNAQLQS 720 
ESNSLQSQED KVSCSESQLQ SQCEQMKQTN INLESRLLKE EELEKEEVQT LQAELACRQT 780 

55 EVKALSTQVE ELKDELVTQR RKHASSIKDL TKQLQQARRK LDQVESGS YD KEVSSMGSRS 840 
SSSGSLNARS SAEDRSPENT GSSVAVDNFP QVDKAMUER IVRLQKAHAR KNEKIEFMED 900 
HIKQLVEEK KKTKIIQSYI LREESGTLSS EASDFNKVHL SRRGGMASL YTSHPADNGL 960 
TLELSLEMR KLQAVLEDTL LKNITLKENL QTLGTEIERL IKHQHELEQR TKKT 

60 

?Eq IP KP;2?2PBjBPNA sequence 
Nucleic Acid Accession*: D83760 

Coding sequence: 56-1459 (underlined sequence corresponds lo start and stop codon) 

65 l 11 21 31 41 51 

I I I I I I 

TTGCCGTGAA GGGCTGTGCG GTTCCCGTGC GCGCCGGAGC CTGCTGTGGC CTCTTATGCA 60 

CTCCACCACC CCCATCAGCT CCCTCTTCTC CTTCACCAGC CCCGCAGTGA AGAGACTGCT 120 

AGGCTGGAAG CAAGGAGATG AAGAGGAAAA GTGGGCAGAG AAGGCAGTGG ACTCTCTAGT 180 

/0 GAAGAAGTTA AAGAAGAAGA AGGGAGCCAT GGACGAGCTG GAGAGGGCTC TCAGCTGCCC 240 

GGGGCAGCCC AGCAAATGCG TCACGATTCC CCGCTCCCTG GACGGGCGGC TGCAGGTGTC 3 00 

CCACCGCAAG GGCCTGCCCC ATGTGATTTA CTGTCGCGTG TGGCGCTGGC CGGATCTGCA 360 

GTCCCACCAC GAGCTGAAGC CGCTGGAGTG CTGTGAGTTC CCATTTGGCT CCAAGCAGAA 420 

AGAAGTGTGC ATTAACCCTT ACCACTACCG CCGGGTGGAG ACTCCAGTAC TGCCTCCTGT 480 

ID GCTCGTGCCA AGACACAGTG AATATAACCC CCAGCTCAGC CTCCTGGCCA AGTTCCGCAG 540 

CGCCTCCCTG CACAGTGAGC CACTCATGCC ACACAACGCC ACCTATCCTG ACTCTTTCCA 600 

GCAGCCTCCG TGCTCTGCAO TCCCTCCCTC ACCCAGCCAC GCGTTCTCOC AGTCCCCGTG 660 

CACGGCCAGC TACCCTCACT CCCCAGGAAG TCCTTCTGAG CCAGAGAGTC CCTATCAACA 720 

CTCAGTTGAC ACACCACCCC TGCCTTATCA TGCCACAGAA GCCTCTGAGA CCCAGAGTGG 780 
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CCAACCTGTA GATGCCACAG CTGATAGACA TGTAGTGCTA TCGATACCAA ATGGAGACTT 840 

TCGACCAGTT TGTTACGAGG AGCCCCAGCA CTGGTGCTCG GTCGCCTACT ATGAACTGAA 900 

CAACCGAGTT GGGGAGACAT TCCAGGCTTC CTCCCGAAGT GTGCTCATAG ATGGGTTCAC 960 

CGACCCTTCA AATAACAGGA ACAGATTCTG TCTTGGACTT CTTTCTAATG TAAACAGAAA 1020 

CTCAACGATA GAAAATACCA GGAGACATAT AGGAAAGGGT GTGCACTTGT ACTACGTCGG 1080 

GGGAGAGGTG TATGCCGAGT GCGTGAGTGA CAGCAGCATC TTTGTGCAGA GCCGGAACTG 1140 

CAACTATCAA CACGGCTTCC ACCCAGCTAC CGTCTGCAAG ATCCCCAGCG GCTGCAGCCT 1200 

CAAGGTCTTC AACAACCAGC TCTTCGCTCA GCTCCTGGCC CAGTCAGTTC ACCACGGCTT 1260 

TGAAGTCGTG TATGAACTGA CCAAGATGTG TACTATCCGG ATGAGTTTTG TTAAGGGTTG 1320 

GGGTGCTGAG TATCATCGCC AGGATGTCAC CAGCACCCCC TGCTGGATTG AGATTCATCT 1380 

TCATGGGCCA CTGCAGTGGC TGGACAAAGT TCTGACTCAG ATGGGCTCTC CACATAACCC 1440 
CATTTCTTCA GTGTCTTAAC AGTCATGTCT TAAGCTGCAT TTCCATAGGA T 



SEQ 10 NO:253 PBJ6 Protein sequence: 
Protein Accession*: NPJM5896 

MHSTTP1SSL FSFTSPAVKR LLGWKQGDEE EKWAEKAVDS LVKKLKKKKG AMDELERALS 60 
CPGQPSKCVT 1PRSLDGRLQ VSHRKGLPHV IYCRVWRWPD LQSHHELKPL ECCEFPFGSK 120 
QKEVCINPYH YRRVETPVLP PVLVPRHSEY NPQLSLLAKF RSASLHSEPL MPHNATYPDS 180 
FQQPPCSALP PSPSHAFSQS PCTAS YPHSP GSPSEPESPY QHSVDTPPLP YHATEASETQ 240 
SGQPVDATAD RHVVLSITNG DFRPVCYEEP QHWCSVAYYE LNNRVGETFQ ASSRSVLIDG 300 
FTDPSNNRNR FCLGLLSNVN RNSTIENTRR HIGKGVHLYY VGGEVYAECV SDSSIFVQSR 360 
NCNYQHGFHP ATVCKIPSGC SLKVFNNQLF AQLLAQS Vim GFEWYELTK MCTIRMSFVK 420 
GWGAEYHRQD VTSTPCWIEI HLHGPLQWLD KVLTQMGSPH NPISSVS 



SEQ ID NO:254 PB JB DNA sentience 
Nucleic Acid Accession*: AB04684 

Coding sequence: 472-4377 (underlined sequence corresponds to start and stop codon) 

1 11 21 31 41 51 

I I I I I I 

TGCAGGTTTG CAGGGTCTGA GATTACTTGG GCTTTTCCTG CCTTTTTCTT TTGCTTAAGG 60 

GATGGACAAG GAGCTGAGAT TTATGACCCT TATTAGAGAA AAAAATGTGC CTTGCTAGGG 120 

TGGGGACACT TGGTTGATGC AGTCTCTCTC TCTCTTTCTC GGTGTTTATA ACAAAACAAA 180 

ACCAAAATGA ACTGAGGGGT TTGTAATGGT AGTTTGTTTG TTGCTGGAGA ATGCTACTTT 240 

GCATGCTTTT TTTCTCTTGC AGGGTATGTT CTGTGTTGTG CTTTTTCTTT TAGAAGCTAC 300 

TAAAGGGTGT TGGGGATGCT TCTGACTATT ATGAAGGCCA AAAGGCCTGT TGACTGGGGC 360 

TGCTTTTAAC CCTTTCCTAT TTGCTGAGAA TGCAGCCGTG TGACAGTAAC TGAACATTGG 420 

TCTAAAGTCT TTCCAAAAGG TCAAGGTTCA CAAGAACATC TGCTCAAATT AATGACCATG 480 

GGGGATATGA AGACCCCAGA CTTTGATGAC CTCCTGGCAG CATTTGACAT CCCAGATATG 540 

GTCGATCCTA AAGCAGCTAT TGAGTCTGGA CACGATGACC ATGAAAGCCA CATGAAGCAG 600 

AATGCTCACG GAGAGGATGA CTCCCACGCA CCATCATCTT CTGATGTGGG TGTCAGCGTT 660 

ATCGTCAAGA ATGTTCGGAA CATTGACTCT TCCGAGGGCG GGGAGAAAGA CGGCCACAAC 720 

CCCACTGGCA ATGGCTTACA TAATGGGTTT CTCACAGCAT CCTCCCTTGA CAGTTACAGT 780 

AAAGATGGAG CAAAGTCCTT GAAAGGAGAT GTGCCTGCCT CTGAGGTGAC ACTGAAAGAC 840 

TCGACATTCA GCCAGTTTAG CCCGATCTCC AGTGCTGAAG AGTTTGATGA CGACGAGAAG 900 

ATTGAGGTGG ATGACCCCCC TGACAAGGAG GACATGCGAT CAAGCTTCAG GTCGAATGTG 960 

TTGACGGGGT CGGCTCCCCA GCAGGACTAC GATAAGCTGA AGGCACTCGG AGGGGAAAAC 1020 

TCCAGCAAAA CTGGACTdC TACGTCAGGC AATGTGGAGA AAAACAAAGC TGTTAAGAGA 1080 

GAAACAGAAG CCAGTTCTAT AAACCTGAGT GTTTATGAAC CTTTTAAAGT CAGAAAAGCA 1140 

GAGGATAAAT TGAAGGAAAG CTCTGACAAG GTGC1GGAAA ACAGAGTCCT AGATGGGAAG 1200 

CTGAGCTCCG AGAAGAATGA CACCAGCCTC CCCAGCGTTG CGCCATCAAA GACAAAGTCG 1260 

TCCTCCAAGC TCTCGTCCTG CATCGCTGCC ATCGCGGCTC TCAGCGCTAA AAAGGCGGCT 1320 

TCAGACTCCT GCAAAGAACC AGTGGCCAAT TCGAGGGAAT CCTCCCCGTT ACCAAAAGAA 1380 

GTAAATGACA GTCCGAGAGC CGCTGACAAG TCTCCTGAAT CCCAGAATCT CATCGACGGG 1140 

ACCAAAAAAC CATCCCTGAA GCAACCGGAT AGTCCCAGAA GCATCTCAAG TGAGAACAGC 1500 

AGCAAAGGAT CCCCGTCCTC TCCCGCAGGG TCCACACCAG CAATCCCCAA AGTCCGCATA 1560 

AAAACCATTA AGACATCTTC TGGGGAAATC AAGAGAACAG TGACCAGGGT ATTGCCAGAA 1620 

GTGGATCTTG ACTCTGGAAA GAAACCTTCC GAGCAGACAG CGTCCGTGAT GGCCTCTGTG 1680 

ACATCCCTTC TGTCGTCTCC AGCATCAGCC GCCGTCCTTT CCTCTCCCCC CAGGGCGCCT 1740 

CTCCAGTCTG CGGTCGTGAC CAATGCAGTT TCCCCTGCAG AGCTCACCCC CAAACAGGTC 1800 

ACAATCAAGC CTGTGGCTAC TGCTTTCCTC CCAGTGTCTG CTGTGAAGAC GGCAGGATCC 1860 

CAAGTCATTA ATTTGAAGCT CGCTAACAAC ACCACGGTGA AAGCCACGGT CATATCTGCT 1920 

GCCTCTGTCC AGAGTGCCAG CAGCGCCATC ATTAAAGCTG CCAACGCCAT CCAGCAGCAA 1980 

ACTGTCGTGG TGCCGGCATC CAGCCTGGCC AATGCCAAAC TCGTGCCAAA GACTGTGCAC 2040 

CTTGCCAACC TTAACCTTTT GCCTCAGGGT GCCCAGGCCA CCTCTGAACT CCGCCAAGTG 2100 

CTAACCAAAC CTCAGCAACA AATAAAGCAG GCAATAATCA ATGCAGCAGC CTCGCAACCC 2160 

CCCAAAAAGG TGTCTCGAGT CCAGGTGGTG TCGTCCITGC AGAGTTCTGT GGTGGAAGCT 2220 

TTCAACAAGG TGCTGAGCAG TGTCAATCCA GTCCCTGTTT ACATCCCAAA CCTCAGTCCT 2280 

CCCGCCAATG CAGGGATCAC GTTACCGACG CGTGGGTACA AGTGC TTGGA GTGTGGGGAC 2340 

TCCTTTGCAC TTGAAAAGAG TCTGACCCAG CACTACGACA GACGGAGCGT GCGCATCGAA 2400 

GTAACGTGCA ACCATTGTAC AAAGAACCTC GTTTTTTACA ACAAATGCAG CCTCCTTTCC 2460 

CATGCCCGTG GGCATAAGGA GAAAGGGGTG GTAATGCAAT GCTCCCACTT AATTTTAAAG 2520 

CCAGTCCCAG CAGATCAAAT GATAGTTTCT CCGTCAAGCA ATACTTCCAC TTCAACTTCC 2580 

ACTCTTCAGA GCCCTGTGGG AGCTGGCACA CACACTGTCA CAAAAATTCA GTCTGGCATA 2640 

ACTGGGACAG TCATATCGGC TCCTTCAAGC ACTCCCATCA CCCCAGCCAT GCCCCTAGAT 2700 

GAAGACCCCT CCAAACTGTG TAGACATAGT CTAAAATGTT TGGAGTGTAA TGAAGTCTTC 2760 

CAGGACGAGA CATCACTGGC TACACATTTC CAGCAGGCTG CAGATACGAG TGGACAAAAG 2820 
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AC TTGCACTA TCTGCCAGAT GCTGCTTCCT AACCAGTGCA GTTATSCATC ACACCAGAGA 2880 

ATCCATCAGC ACAAATCTCC CTACACCTGC CCTGAGTGTG GGGCCATCTG CAGGTCGGTG 2940 

CACTTCCAGA CCCACGTCAC CAAGAACTGT CTGCACTACA CGAGGAGAGT TGGTTTTCGA 3000 

TGTGTGCATT GCAATGTTGT GTACTCTGAT GTGGCTGCTC TGAAGTCTCA CATTCAAGGT 3060 

TCICACTGTG AAGTCTTCTA CAAGTGTCCT ATTTGTCCAA TGGCGTTTAA GTCTGCCCCA 3120 

AGCACACATT CCCACGCCTA CACACAGCAT CCTGGCATCA AGATAGGAGA ACCAAAAATA 3180 



TTTGACCAAC ACATTGAAAA CCAGAAGGTG TCTGTTTTCA AGTGTCCAGA CTGTTCTCTT 3300 

TTATATGCAC AGAAGCAACT TATGATGGAC CATATCAAGT CTATGCATGG AACATTGAAA 3360 

AGTATTGAAG GGCCTCCAAA CTTGGGTATA AACTTGCCTT TGAGCATTAA GCCTGCAACT 3420 

CAAAATTCAG CAAATCAGAA CAAAGAGGAC ACCAAATCCA TGAATGGGAA AGAGAAATTG 3480 

GAAAAGAAAT CTCCATCTCC TGTGAAAAAA TCAATGGAAA CCAAGAAAGT GGCCAGTCCT 3540 

GGGTGGACGT GTTGGGAGTG TGACTGCCTG TTCATGCAGA GAGATGTGTA CATATCCCAC 3600 

GTGAGGAAGG AGCACGGGAA GCAAATGAAG AAACACCCCT GCCGCCAGTG TGACAAGTCT 3660 

TTCAGCTCGT CCCACAGCCT GTGCOGGCAC AACCGGATCA AGCACAAAGG CATCAGGAAA 3720 
GTGTACGCCT GCTCGCACTG CCCAGACTCC AGACGT2 
GAGAAGCACG TCCAGCTGAT GCATGGCATC AAGGACCCTG A 

GCCACCAATG AGGAGGAAAC AGAAATAAAA GAAGACACTA AGGTCCCCAG TCCCAAGCGG 3900 

AAGTTGGAAG AACCAGTTCT GGAGTTCAGG CCTCCCCGAG GAGCAATCAC TCAACCACTG 3960 

AAAAAGCTGA AAATCAATGT TTTTAAGGTT CACAAGTGTG CCGTGTGTGG CTTCACCACC 4020 

GAAAACCTGC TCCAATTCCA CGAACACATC CCTCAGCACA AATCGGATGG TTCTTCCTAC 4080 

CAGTGCCGGG AGTGTGGCCT CTGCTACACG TCTCACGTCT CTCTGTCCAG GCACCTCTTC 4140 

ATCGTACACA AGTTAAAGGA ACCTCAGCCA GTGTCCAAGC AAAATGGGGC TGGGGAAGAT 4200 

AACCAACAGG AGAACAAACC CAGCCACGAG GATGAATCCC CTGATGGCGC CGTGTCAGAC 4260 

AGAAAGTGCA AAGTGTGCGC AAAAACTTTT GAAACTGAAG CTGCCTTAAA TACTCACATG 4320 

CGGACACACG GCATGGCCTT CATCAAATCC AAAAGGATGA GCTCAGCCGA GAAATAGCCA 4380 

CAGATGCTCC ATGAGGAAAA TCCCTGTCCA CATTGGAATA AAAAAGACAT TTTTGTTACA 4440 

AAGTTTGCAG TATAATAGAG TTAACAGTAC TGTCTAGGCT GTTGCAATAT ATTCTCTTTC 4500 

AATGTACCTT CCTTCACCTC GTCGTATATA TCCTCGATAA GTATTAAAAC AGTATTTGAG 4560 

TTTAAAAGAG TTTGTATATA TTTAAATGAA TAACTTTTTA TACTCTTTGT TACATGOTTG 4620 

TATCAGTATT TAGTGGAAAA CCATTTGAGT TGTTTTGGGT TAGAATTTTT CTTTTTGTAC 4680 

TGTTTCTTTA AAACAGAGTT CTTAGTAACA GGGGCAGTTC CTGAATTCAA ATAAACCATT 4740 

TTGTATGTTT GGATTTTGAA TGGGTTAACT AATTACAGGC TAAAATAATG CCTTTTTTAG 4800 

TGTTTTTAAT TTTTAGAATT CACTACATAA ATTGTAAGTA ATTGTGGGTC TCAAAAACAC 4860 

TAGGAACTTT TAAGTGTCTT AGCACTTCCT CGATGTGCCT GCCCTGAGGG AGTGAGTTCA 4920 

CATTTCAGAC AACTGCACTC CAGTGTGGAC GTGCCTTTCT CTTCAGGCCA TGCCGAAGGG 4980 

TGTTTAAAGC AGTCTTGCAG GTCGCTCCTT TCCCAGCCGT GGATAAAAAC TGAAGCTAGG 5040 
AATCTAATAA GGAATGCTGA TTTCCTCAGT TCCATTT 



GGAGGGAAAG TAACGAAAGG GCTGGACTAC TATAAAAGTT ACAAATACGT AGTTAGACCA 5220 

ATAGATTTAT ATAGTCAGGT TTTTGTCATG TAATTTATTA ACTAACTATT ACAGAAACAC 5280 

AGCTAAGAAT ATCAAGTATT TCTCTGGCTC TTGACAGAAA AAAATCAGTT GACTTAACCC 5340 

TTAGAGTCGG GATGCACAAC TTCAACCACC GACTTATCAA TGCAGCCGCC TGTGTATTGC 5460 

AMTGGCCGT TACCTTAAGC ACTGAGCCAC CCGGGTTTAG TTCAGCCATT TCAAGAAGTA 5520 

TATTTAACGT CGGTAGTTCT GCTTTATTAA AATGCAGCAG AGGTACTCTT CTGTCCCTTC 5580 

CGTTTATAGT TCTCTGAGAG AGTTCTATTT TTTGGTTTTG TTTTGTGTTT TCTTTTGCAT 5640 

TTTGTATCTT GTATTTATCC CTGAACATGT TTTGTACCTT TTTTTTTTTT TTTTTTTTAA 5700 

GAAAAGGAA? TCTTTTGTGT ATATATAGAT ACTTGCATGA TATACTGTAG TCAATGTTCG 5760 

GTTCCTCAAA AGGTCTTGCT GCTGTCAGGT GTTATGCACT CCATCCATCA TAACTGTATG 5820 
AAACACATTT CATATGTAAA TAAACGTGGG ACATTTG 

?5PW8 Protein sgfflfl&S 



Protein Accession*: BAB13455 

MKTPDFDDLL AAFDIPDMVD PKAAIESGHD DHESHMKQNA HGEDDSHAPS SSD VGVSVIV 60 
KNVRNIDSSE GGEKDGHNPT GNGLHNGFLT ASSUDSYSKD GAKSLKGDVP ASEVTLKDST 120 
FSQFSP1SSA EEFDDDEKIE VDDFPDKEDM RSSFKSN VLT GSAPQQDYDK LKALGGENSS 180 
KTGLSTSGNV EKNKAVKRET EASS1NLSVY EPFKVRKAED KLKESSDKVL ENRVLDGKLS 240 
SEKNDTSLPS VAPSKTKSSS KLSSC1AAIA ALSAKKAASD SCKEPVANSR ESSPLPKEVN 300 
DSPRAADKSP ESQNL1DGTK KPSLKQPDSP RS1SSENSSK GSPSSPAGST PA1PKVR1KT 360 
IKTSSGEIKR TVTRVLPEVD LDSGKKPSEQ TASVMASVTS LLSSPASAAV LSSPPRAPLQ 420 
SAWTNAVSP AELTPKQVTI KPVATAFLPV SAVKTAGSQV INLKLANNTT VKATVISAAS 480 
VQSASSAI1K AANAIQQQTV WPASSLANA KLVPKTVHLA NLNLLPQGAQ ATSELRQ VLT 540 
KPQQQKQAI INAAASQPPK KVSRVQVVSS LQSSVVEAFN KVLSSVNPVP VYIPNLSPPA 600 
NAGITLPTRG YKCLECGDSF ALEKSLTQHY DRRSVREVT CNHCTKNLVF YNKCSLLSHA 660 
RGHKEKGVVM QCSHLILKPV PADQMIVSPS SNTSTSTSTL QSPVGAGTHT VTKIQSGITG 720 
TVISAPSSTP ITPAMPLDED PSKLCRHSLK CLECNEVFQD ETSLATHFQQ AADTSGQKTC 780 
TICQMIXPNQ CS YASHQRIH QHKSPYTCPE CGAICRS VHF QTHVTKNCLH YTRRVGFRCV 840 
HCNVV YSDVA ALKSHIQGSH CEVFYKCPIC PMAFKS APST HSHAYTQHPG KIGEPK1IY 900 
KCSMCDTVFT LQTLLYRHFD QHffiNQK VSV FKCPDCSLLY AQKQLMMDHI KSMHGTLKSI 960 
EGPPNLGINL PLSIKPATQN SANQNKEDTK SMNGKEKLEK KSPSPVKKSM ETKKVASPGW 1020 
TCWECDCLFM QRDVYISHVR KEHGKQMKKH PCRQCDKSFS SSHSLCRHNR IKHKGIRKVY 1080 
ACSHCPDSRR TFTKRLMLEK HVQLMHGIKD PDLKEMTDAT NEEETEIKED TKVPSPKRKL 1140 
EEPVLEFRPP RGA1TQPLKK LKINVFKVHK CAVCGFTTEN LLQFHEHIPQ HKSDGSSYQC 1200 
RECGLCYTSH VSLSRHLFTV HKLKEPQPVS KQNGAGEDNQ QENKPSHEDE SPDGAVSDRK 1260 
CKVCAKTFET EAALNTHMRT HGMAFIKSKR MSSAEK 



408 



WO 02/30268 



PCT/US01/32045 



SEQIPNO:i>56PBM1 PNAseouenoe 
Nucleic Acid Accession*: AF1 11847 

Coding sequence: 58-1608 (underlined sequence corresponds lo start and slop codon) 



AACAAGGTGT GTTTTGATTG TGGTGCCAAA AATCCCAGCT GGGCAAGCAT AACCTATGGA 
GTGTTCCTTT GCATTGATTG CTCAGGGTCC CACCGGTCAC TTGGTGTTCA C 
ATTCGATCTA CAGAGTTGGA TTCCAACTGG TCATGGTTTC AGTTGCGATG 

GGAGGAAACG CTAGTGCATC TTCCTTTTTT CATCAACATG GGTGTTCCAC CAATGACACC 3 GO 

AATGCCAAGT ACA&CAGTCG TGCTGCTCAG CTCTATAGGG AGAAAATCAA ATCGCTCGCC 420 

TCTCAAGCAA CACGGAAGCA TGGCACTGAT CTGTGGCTTG ATAGTTGTGT GGTTCCACCT 4 BO 

TTGTCCCCTC CACCAAAGGA GGAAGATTTT TTTGCCTCTC ACGTTTCTCC TGAGGTGAGT 540 

GACACAGCGT GGGCATCAGC AATAGCAGAA CCATCTTCTT TAACATCAAG GCCTC3TGGAA 600 

ACCACTTTGG AAAATAATGA AGGTGGACAA GAGCAAGGAC CAAGTGTGGA AGGTCTTAAT 660 

GTACCAACAA AGGCTACTTT AGAGGTATCC TCTATCATAA AAAAGAAACC AAATCAAGCT 720 

AAAAAAGGCC TTGGGGCCAA AAAAGGAAGT TTGGGAGCTC AGAAACTGGC AAACACATGC 7B0 

TTTAATGAAA TTGAAAAACA AGCTCAAGCT GCGGATAAAA TGAAGGAGCA GGAAGACCTG 840 

GCCAAGGTGG TATGTAAAGA AGAATCAATT GTTTCATCAT TACGATTAGC CTATAAGGAT 900 

CTTGAAATTC AAATGAAGAA AGACGAAAAG ATGAACATTA GTGGCAAAAA AAATGTTGAC 960 

TCAGACAGAC TCGGCATGGG ATTTGGAAAT TGCAGAAGTG TTATTTCACA TTCAGTGACT 1020 

TCAGATATGC AGACCATAGA GCAGGAATCA CCCATTATGG CAAAACCAAG AAAAAAGTAT 1080 

AATGATGACA GTGACGATTC ATATTTTACT TCCAGCTCAA GTTACTTTGA CGAGCCAGTG 1140 

GAGTTAAGGA GCAGTTCTTT CTCTAGCTGG GATGACAGTT CAGATTCCTA TTGGAAAAAA 1200 

GAGACCAGCA AAGATACTGA AACAGTTCTG AAAACCACAG GCTATTCAGA CAGACCTACT 1260 

GCTCGCCGCA AGCCAGATTA TGAGCCAGTT GAAAATACAG ATGAGGCCCA GAAGAAGTTT 1320 

GGCAATGTCA AGGCCATTTC ATCAGATATG TATTTTGGAA GACAATCCCA GGCTGATTAT 13 80 

GAGACCAGGG CCCGCCTAGA GAGGCTGTCG GCAAGTTCCT CCATAAGCTC GGCTGATCTG 1440 

TTCGAGGAGC CGAGGAAGCA GCCAGCAGGG AACTACAGCC TGTCCAGTGT GCTGCCCAAC 1500 

GCCCCCGACA TGGCGCAGTT CAAGCAGGGA GTGAGATCGG TTGCTGGAAA ACTCTCCGTC 1560 

TTTGCTAATG GAGTCGTGAC TTCAATTCAG GATCGCTACG GTTC TTAA TA CTGAAGTCAT 1620 

GATGTGTATT TCCTGGAGAA ATTCCTCTTT AAATGAACAA GTAACCACAT CTCAGGCGGC 1680 

AGTGAAGTCC AGATAGTTTT GCAGATTGTT TTGCTACTTT TTCATATGGT ATATGTTTCT 1740 

GATTTTTAAT ATTTCTTTTG AGAAATTCTG AGTTCTGATG TAGGAGCTTT CCTGTGATTT 1800 

ATTTTCTTGG AACCTTTGAT TTCAACACTG AGGGCCTGGA GACCTCGGCT CCTCCTGCTC 1920 

CTGAACCAGG AGGCTTCATG TGGGGGAGGA GGAGAGGTCT CCATGTGACA CATGGGCTCA 1980 

GGGCTGCCAG AATCAGCGGA TGCTGGATGG GCCTGCAGAA ACAACACTCA CCACACACAC 2040 

TTCCTTCAAA AGACCAAAAG TGACTGGTGT CTCGTGTGAC AGATTGCTTC ATTTATGTTT 2100 

CTACATAGTA AGGTGACTGC CAAATAATAT TTGAAGTCAT CTGTCTCTTT GTAAATTATT 2160 

TTATATGACC TATAAATTTA AAAATGTTTT TCAGTGAGTG CTTTTAACAA ACTTAAGCTT 2220 

CTGCCCTGCC AAGGGAATTA ATGTTATCTT GTGAAAGGTG TTGCTGTTTG AATTGATGAG 2280 

AAATGGAAGA TGAGAACTCC CTAAGAGTTC TCATAATAAA TCATCTCATC ACAAATCAAT 2340 

ACGGTATACA GAGTTAAAGT GGAATCAGGT AAGAAGATAC AGCTACAGAA AATAGTTGCG 2400 

TGTATGGGAG AACAGTCATT GTAATTGGGT AGTTTTGTTA ATAAATATTT TTAAATCTTG 2460 

CTTTTCAGAA ATTACCGAAT GTGTATAAAC AAATAAAGAA AAATAATTTA GCTGTGTTTT 2520 
AGACAGCATT AGAATATATT GTTCAGCACA GTAAAATATA TT 



CTAAATAGAA TAAACACTTG CAGCAGAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2700 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 
AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



MGDPSKQDIL T1FKRLRSVP TNKVCFDCGA KNPSWASLTY GVELCIDCSG SHRSLGVHLS 60 
FKSTELDSN WSWFQLRCMQ VGGNASASSF FHQHGCSTND TNAK.YNSRAA QLYREK1KSL 1 20 
ASQATRKHGT DLWLDSCVVP PLSPPPKEED FFASHVSPEV SDTAWASAIA EPSSLTSRPV 1 80 
ETTLENNEGG QEQGPSVEGL NVPTKATLEV SSIIKKKPNQ AKKGLGAKKG SLGAQKLANT 240 
CFNBEKQAQ AADKMKEQED LAKWSKEES IVSSLRLAYK DLEIQMKKDE KMNISGKKNV 300 
DSDRLGMGFG NCRS VISHS V TSDMQT1EQE SPIMAKPRKK YNDDSDDS YF TSSSS YFDEP 360 
VELRSSSFSS WDDSSDSY WK KETSKDTETV LKTTGYSDRP TARRKPDYEP VENTDEAQKK 420 
FGNVKAISSD MYFGRQSQAD YETRARLERL SASSSKSAD LFEEPRKQPA GNYSLSSVLP 480 
NAPDMAQFKQ GVRSVAGKLS VFANGWTSI QDRYGS 

SEQ ID NO:258 PBM4 DNA secuence 
Nucleic Acid Accession*: O30891 

Coding sequence: 1-4032 (underlined sequence corresponds to start and stop codon) 

ATGGATACTG TCATGAAGCA GACACATGCT GACACACCTG TTGATCATTG TCTATCTGGC 60 
ATAAGAAAGT GTAGCAGCAC CTTTAAGCTT AAAAGTGAAG TCAACAAGCA TGAAACAGCC 120 
CTTGAAATGC AGAATCCAAA TTTGAACAAT AAAGAATGTT GTITCACCTT TACGTTG AAT 180 
GGAAACTCCA GAAAATTAGA CCGTAGTGTG TTTACAGCAT ATGGTAAACC CAGCGAGAGT 240 
ATCTACTCAG CCCTGAGTGC TAATG ACTAT TTCAGTGAAA GGATAAAGAA TCAGTTTAAT 300 
AAGAACATTA TTGTTTATG A AG AAAAGACA ATAGATGG AC ATATAAATTT AGGAATGCCT 360 
CTCAAGTGCC TGCCTAGTGA TTCTCATTTT AAAATTACAT TTGGTCAAAG AAAGAGTAGC 420 
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AAAGAAGATG G AC ACATATT ACGCCA ATGT G AAAATCC AA ACATGG AATG CATTCTTTTT 480 
CATGTTGTTG CTATAGGAAG GACAAGAAAG AAGATTGTTA AGATCAACGA ACTTCATGAA 540 
AAAGGAAGTA AACTTTGTAT TTATGCCTTG AAGGGTGAGA CTATTGAAGG AGCCTTATGC 600 
AAGGATGGCC GTTTTCGGTC TG ACATAGGT GAATTTGAAT GGAAACTAAA GGAAGGTCAT 660 
5 AAGAAAATTT ATGGAAAACA GTCCATGGTG GATGAAGTAT CTGGAAAAGT CTTAGAAATG 720 
GACATTTCAA AAAAAAAAGC ATTACAACAG AAAGATATCC ATAAAAAAAT TAAACAGAAT 780 
GAAAGTGCCA CTGATGAAAT TAATCACCAG AGTCTGATAC AGTCTAAGAA AAAAGTCCAC 840 
AAACCAAAGA AAGATGGAGA GACCAAAGAT GTAGAACACA GCAGAGAGCA AATTCTGCCA 900 
CCTCAGGATC TAAGCCATTA TATTAAAGAT AAAACTCGCC AGACAATTCC CAGGATTAGA 960 

10 AATTATTACT TTTGTAGTTT GCCCCGAAAA TATAGGCAAA TAAACTCACA AGTTAGACGG 1020 
AGGCCGCATC TGGGTAGGCG GTATGCTATT AATCTGGATG TCCAAAAGGA GGCAA1TAAT 1080 
CTCTTAAAGA ATTATCAAAC GTTGAATGAA GCCATAATGC ATCAGTATCC GAATTTTAAA 1140 
GAGGAGGCAC AGTGGGTAAG AAAATATTTT CGGGAAGAAC AAAAGAGAAT GAATCTTTCA 1200 
. _ CCAGCTAAGC AATTCAACAT ATATAAAAAG GACTTCGGAA AAATGACTGC AAATTCTGTT 1260 

15 TCAGTTGCAA CCTGCGAACA GCTTACATAT TATAGCAAGT CAGTTGGGTT CATGCAATGG 1320 
GACAATAATG GAAACACAGG TAATGCTACT TGCTTTGTCT TCAATGGTGG ITATATTTTC 1380 
ACCTGTCGAC ATGTTGTACA TCTTATGGTG GGTAAAAACA CACATCCAAG TTTGTGGCCA 1440 
GATATAATTA GCAAATGTGC GAAGGTAACC TTCACTTATA CAGAGTTCTG CCCTACTCCT 1500 
GACAATTGGT TTTCCATTGA GCCATGGCTT AAAGTGTCCA ATGAAAATCT AGATTATGCC 1560 

20 ATTTTAAAAC TAAAAGAAAA TGGAAATGCG TTTCCTCCAG GACTATGGCG ACAGATTTCT 1620 
CCTCAACCAT CTACTGGTTT GATTTATTTA ATTGGTCATC CTGAAGGCCA GATCAAGAAA 1680 
ATAGATGGTT GTACTGTGAT TCCTCTAAAC GAACGATTGA AAAAATATCC AAACGATTGT 1740 
CAAGATGGGT TGGTAGATCT CTATGATACC ACCAGTAATG TATACTGTAT GTTTACCCAA 1800 
AGAAGTTTCC TATCAGAGGT TTGGAACACA CACACGCTTA GTTATGATAC TTGTTTCTCT 1860 

25 GATGGGTCCT CAGGCTCCCC AGTGTTTAAT GCATCTGGCA AATTGGTTGC TTTGCATACC 1920 
TTTGGGCTTT TTTATCAACG AGGATTTA AT GTGCATGCCC TTATTG AATT TGGTTATTCT 1980 
ATGGATTCTA TTCTTTGTGA TATTAAAAAG ACAAATGAGA GCTTGTATAA ATCATTAAAT 2040 
GATGAGAAAC TTG AGACCTA CGATG AAG AG AAAGCCCGGC CCAGGCCAGC CTACCGGCG A 2100 

_ CTAGGATGCT TTCGCTTTCG CTCTCGCTTT CCAATACTCG GGACTGGGGA AACCGGGAGA 2160 

30 ATAGAAGCAG GCAAGGACCG CCGTGGGCAC GGGGTCAGTG AGACAGGGTC CTGCTCGCGG 2220 
CGTCAAGGAG GAGCGCTGTG GGTGTCCCCA GCGCAGCCAA TCGGCTTCCG AAGTAGCTGG 2280 
AGCTCTGGAG CCTTTGCTTC CTCAAATACG AGCGGGAACT GCGTTGAGCG CTGGATTCC A 2340 
GGCCGAGTGC TGGCGAGGCG CGCAGTCTCT AAAGAGCAAC AGAATAATTG CAGTACTTCT 2400 
CTAATGAGGA TGGAGTCTAG AGGAGACCCA AGAGCCACAA CTAATACCCA GGCTCAAAGA 2460 

35 TTCCATTCAC CTAAGAAAAA TCCAGAAGAC CAGACCATGC CCCAAAATAG GACAATATAT 2520 
GTTACCTTGA AGG CTGTC AG AAAAG AG ATA GAAACTCACC AAGGCCAAGA AATGCTTGTG 2580 
CGTGGCACAG AAGGAATCAA AGAGTACATA AACCTTGGAA TGCCCCTCAG TTGTTTCCCT 2640 
GAAGGTGGCC AGGTGGTCAT TACATTTTCC CAAAGTAAAA GTAAGCAGAA GGAAGATAAC 2700 

A CACATATTTG GCAGGC AGGA CAAAGCATCG ACTGAATGTG TCAAATTTTA CATTCATGCA 2760 

40 ATTGGAATTG GGAAGTGTAA AAGAAGGATT GTTAAATGTG GGAAGCTTCA CAAAAAGGGG 2820 
CGCAAACTCT GTGTTTATGC TTTCAAAGGA GAAACCATCA AGGATGCACT GTGCAAGGAT 2880 
GGCAGATTTC TTTCCTTTCT GGAGAATGAT GATTGGAAAC TCATTGAAAA CAATGACACC 2940 
ATTTTAGAAA GCACCCAGCC AGTTGATGAA TTAGAAGGCA GATACTTTCA GGTTGAGGTT 3000 
. , GAGAAAAGAA TGGTCCCCAG TGCAGCAGCT TCTCAGAATC CTGAGTCAGA GAAAAGAAAC 3060 

45 ACCTGTGTGT TGAGAGAACA AATCGTGGCT CAGTACCCCA GTTTGAAAAG AGAAAGTGAA 3 120 
AAAATCATTG AAAACTTCAA GAAAAAAATG AAAGTAAAAA ATGGGGAAAC ATTATTTGAA 3180 
TTGCATAGAA CAACGTTTGG GAAAGTAACA AAAAATTCTT CTTCGATTAA AGTAGTGAAA 3240 
CTTCTTGTAC GTCTCAGTGA CTCAGTTGGG TACTTATTCT GGGACAGTGC AACTACGGGT 3300 

„ TACGCCACCT GCTTTGTTTr TAAAGGATTG TTCATTTTAA CTTGTCGGCA TGTAATAGAT 3360 

50 AGCATTGTGG GAGACGGAAT AGAGCCAAGT AAGTGGGCAA CCATAATTGG TCAATGTGTA 3420 
AGGGTGACAT TTGGTTATGA AGAGCTAAAA GACAAGGAAA CAAACTACTT TTTTGTTGAA 3480 
CCTTGGTTTG AGATACATAA TGAAGAGCTT GACTATGCTG TCCTGAAACT GAAGGAAAAT 3540 
GG AC A AC AAG TACCTATGGA ACTATATAAT GGAATTACTC CTGTGCCACT TAGTGGGTTG 3600 
ATACATATTA TTGGCCATCC ATATGGAGAA AAAAAGCAGA TTG ATGCTTG TGCTGTGATC 3660 

55 CCTCAGGGTC AGCGAGCAAA GAAATGTCAG GAACGTGTTC AGTCTAAAAA AGCAGAAAGT 3720 
CCAGAGTATG TCCATATGTA TACTCAAAGA AGTTTCCAGA AAATAGTTCA CAACCCTGAT 3780 
GTGATTACCT ATGACACTGA ATTTTTCTTT GGGGCTTCCG GCTCCCCTGT GTTTGATTCA 3840 
AAAGGTTCAT TGGTGGCCAT GCATGCTGCT GGCTTTGCTT ATACTTACCA AAATGAGACT 3900 
_ _ CGTAGTATCA TTGAGTTTGG CTCTACCATG GAATCCATCC TCCTTGATAT TAAGC AAAGA 3960 

60 CATAAACCAT GGTATGAAG A AGTATTTGTA AATCAGCAGG ATGTAGAAAT GATGAGTGAT 4020 
GAGGACTTGTJJAGAATTCAG TCTACTGGAT TTAAGGGAAT GGCTTATGGA GTTGTTATTT 4080 
CGTAGGCATT GAAAATGGTT TTCTAAACTC CAAAATGGTC ATCTTATCAA TAATAATAAT 4140 
ATTGACCATT TCCTATCTGC CAGGCATTTT TCTAAGCACA TGAAGAAATT AGTCCTAACA 4200 

r , ACACTATGAG ATGGACTATA ACTTGCCCAA ATTTTTTTTT TTTTTGAGAC TGAGTCTCAC 4260 

65 TCTGTCGCCT GGGCTGGAGT ACAGTGGTGC GATCTCAGCT CACTGCAACT TCCACCTCCC 4320 
AGGTTCAAGC GATTCTTATG CCTC AGTCTC CTGAGCAGCT GGGATTACAG GCAAACGCCA 4380 
CCACACCCAG CTA AATTTTT TTTTTTTTTT TGTATTTTTA GTAGAGACAG GGTTTCACCA 4440 
TGTTGGTCAG GCGGGTCTCG AACTCCTGAC CTCGTGATCC ACCTGCCTCG GCCTTCCAAA 4500 

_ GTGCTGGGAT TACAAGTTTG AGCCACTGCA CCTGGCTAAC TTGCCCTATT TTAAAGTCAA 4560 

70 GCAATGGGAA GAATAACAAG ATTATATAGT AATCAGTTTC ATGACACTAA AAGTCATATA 4620 
GTCATAGGGT TTTTTCATCT TTCATATCTT TGCCTAAATT CATTTGCTAC AGTGCAGGAA 4680 
CCAAAACTTG TTCATCTCAT GATTCCCTAC ATCTGACATA AGGAAAGTAA GTGCTCAGAA 4740 
AAATGTGCAG GTCAATAAGT TGCAAAAGTT GGGGCTGCAA TTAATGCTAA CATAAGAGCT 4800 

_ AAATGCTTGA TTAGAAATG A TCTCAAAACC TTTTAGAATT TCCAAAATCT TCATATTACT 4860 

75 GAAACTGTCG GAATATATGG GTCCTGAAAT TCAGAAGATG ATAGTCACTC TTCCCATATT 4920 
TATAGGCTAT TAAGGCAAGG GATATCTTAA ACATCATATT ACTTTATTTA GATTTCTACT 4980 
ACTCCAATTA TTAATGTTAT GTATTTCTCA TTGTTTTACT TCTTCATGGT ATTATGAAGA 5040 
CTATATAGAT GATTCAACCA AGCCTGCAAA TCTCCCTCTT GTGGAATTCC ACTGGACCCA 5100 
ATCTGTTTTC CATTTCCATT GCAATACTAC TAAAGCCATA CAATATCAAG CACCCTCCCT 5160 
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CTAGGTCCAG GGACTATCAC AGAAGAAGCA GGCATGTAAG ATTTTAAGGA CTGGTTTCGA 5220 
GGGGTCGAGT GTAGGAAAAC AGCCTGTTGC ATTGTAAGAG TGATGTCACC TTGAAGAGCA 5280 
GCTGGCATGA TGACTGCTGT TTGACTCCTG CATACCAAGA TATTCTGCAG CAATGTCTTT 5340 
AAACAGTGCC GGTAGTACAG ATAACCCCTC ATAAAGATGC TTATCTAACC TCCCCAGTGT 5400 
5 TCAGGTGTTT CACAAG AA AG TCTGAGATAT GACTAGCTAC ACGTTTTGCC A AAAATGCTT 5460 
GTTATATAAA GGGTACTTTT GGGAGGGTGA GTGCCGCCAT TTAGTGGCTG CTAGAAACAT 5520 
TGCTTCTGTT TGTAAGTTCC TATTAAATGT TCTTTCTGAG AAAAAAAAAA A 

10 SEQ ID NO:259 PBM4 Protein sequence: 
PBM4 Protein sequence: BAB67788 

MDTVMKQTHA DTPVDHCLSG KKCSSTFKL KSEVNKHETA LEMQNPNLNN KECCFTFTLN 60 
GNSRKLDRSV FTAYGKPSES IYSALSANDY FSERIKNQFN KNIIVYEEKT 1DGHINLGMP 120 

15 LKCLPSDSHF HTFGQRKSS KEDGHILRQC ENPNMECILF HVVAIGRTRK KIVKINELHE 180 

KGSKLCIYAL KGETIEGALC KDGRFRSDIG EFEWKLKEGH KKIYGKQSMV DEVSGKVLEM 240 
DISKKKALQQ KDIHKKIKQN ESATDEINHQ SLIQSKKKVH KPKKDGETKD VEHSREQILP 300 
PQDLSHYIKD KTRQTIPRIR N YYFCS LPRK YRQINSQ VRR RPHLGRRYAI NLDVQKEAM 360 
LLKNYQTLNE ABvlHQYPNFK EEAQWVRKYF REEQKRMNLS PAKQFNIYKK DFGKMTANSV 420 

20 SVATCEQLTY YSKS VGFMQW DNNGNTGNAT CFVFNGGYIF TCRHWHLMV GKNTHPSLWP 480 
DIISKCAKVT FTYTEFCPTP DKWFSffiPWL KVSNENLDYA ILKLKENGNA FPPGLWRQIS 540 
PQPSTGLIYL IGHPEGQIKK IDGCTVIPLN ERLKKYPNDC QDGLVDLYDT TSNVYCMFTQ 600 
RSFLSEVWNT HTLSYDTCFS DGSSGSPVFN ASGKLVALHT FGLFYQRGFN VHAUEFGYS 660 
MDSILCDIKKTNESLYKSLN DEKLETYDEE KARPRPAYRR LGCFRFRSRFPILGTGETGR 720 

25 IEAGKDRRGH GVSETGSCSR RQGGALWVSP AQPIGFRSSW SSGAFASSNT SGNCVERWIP 780 
GEVLARRAVS KEQQNNCSTS LMRMESRGDP RATTNTQAQR FHSPKKNPED QTMPQNRTIY 840 
VTLKAVRKEI ETHQGQEMLV RGTEGIKEYI NLGMPLSCFP EGGQVVITFS QSKSKQKEDN 900 
HIFGRQDKAS TECVKFYIHA IGIGKCKRRI VKCGKLHKKG RKLCVYAFKG ETIKDALCKD 960 
GRFLSFLEND DWKLIBNNDT ILESTQPVDE LEGRYFQVEV EKRMVPSAAA SQNPESEKRN 1020 

30 TCVLREQIVA QYPSLKRESE KIIENFKKKM KVKNGETLFE LHRTTFGKVT KNSSSIKWK 1080 
LLVRLSDSVG YLFWDS ATTG YATCFVFKGL FILTCRHVID SIVGDGIEPS KWATHGQCV 1 140 
RVTFGYEELK DKETNYFFVE PWFEIHNEEL DYAVLKLKEN GQQVPMELYN GITPVPLSGL 1200 
IHHGHPYGE KKQIDACAVI PQGQRAKKCQ ERVQSKKAES PEYVHMYTQR SFQKIVHNPD 1260 
VrrYDTEFFF GASGSPVFDS KGSLVAMHAA GFAYTYQNET RSIIEFGS1M ESILLDIKQR 1320 

3 5 HKPW YEEVFV NQQD VEMMSD EDL 

SEQ ID NO:gG0 PBQ1 DNA sequence 
Nucleic Acid Accession*: NMJM5642 
4U Coding sequence: 489-2489 (underlined sequence corresponds to start and stop codon) 

1 11 21 31 41 51 

,.11 I I I I 

45 ACATTTCAAA AAAAATACAT AGACTGATGT TTCAGACTTG TGCAGCATAA GCCTACAGGG 60 

TACGAAGAAT GAACTCTGAG AATGTTTGGA GAATGTTTCA TCATTACTAA CAGGATATTC 120 

CTCATGACAT TGCTGTCTGA TCTTTGACCA TCAGTCTGTG ACCTGCCCCT TCTCTTTACA 180 

TGCAGCCGCT CTCTGCTCCC TGCCCCAATG AACATCTGCA CTAGGCCCAA GCCTTGGAGT 240 

_ AATTTACCTG AAGAGTGACA CCATTGATTT TGAAACTACT GAAGAAACCC AAGACAGCTG 300 

50 AAAACCAGAA GGCATCTGAG GAGAATGAGA TTACTCAGCC GGGTGGATCC AGCGCCAAGC 360 

CGGGCCTTCC CTGCCTGAAC TTTGAAGCTG TTTTGTCTCC AGACCCAGCC CTCATCCACT 420 

CAACACATTC ACTGACAAAC TCTCACGCTC ACACCGGGTC ATCTGATTGT GACATCAGTT 480 

GCAAGGG GAT GACCGAGCGC ATTCACAGCA TCAACCTTCA CAACTTCAGC AATTCCGTGC 540 

TCGAGACCCT CAACGAGCAG CGCAACCGTG GCCACTTCTG TGACGTAACG GTGCGCATCC 600 

55 ACGGGAGCAT GCIGCGCGCA CACCGCTGCG TGCTGGCAGC CGGCAGCCCC TTCTTCCAGG 660 

ACAAACTGCT GCTTGGCTAC AGCGACATCG AGATCCCGTC GGTGGTGTCA GTGCAGTCAG 720 

TGCAAAAGCT CATTGACTTC ATGTACAGCG GCGTGCTACG GGTCTCGCAG TCGGAAGCTC 780 

TGCAGATCCT CACGGCCGCC AGCATCCTGC AGATCAAAAC AGTCATCGAC GAGTGCACGC 840 

. _ GCATCGTGTC ACAGAACGTG GGCGATGTGT TCCCGGGGAT CCAGGACTCG GGCCAGGACA 900 

60 CGCCGCGGGG CACTCCCGAG TCAGGCACGT CAGGCCAGAG CAGCGACACG GAGTCGGGCT 960 

ACCTGCAGAG CCACCCACAG CACAGCGTGG ACAGGATCTA CTCGGCACTC TACGCGTGCT 1020 

CCATGCAGAA TGGCAGCGGC GAGCGCTCTT TTTACAGCGG CGCAGTGGTC AGCCACCACG 1080 

AGACTGCGCT CGGCCTGCCC CGCGACCACC ACATGGAAGA CCCCAGCTGG ATCACACGCA 1140 

TCCATGAGCG CTCGCAGCAG ATGGAGCGCT ACCTGTCCAC CACCCCCGAG ACCACGCACT 1200 

65 GCCGCAAGCA GCCCCGGCCT GTGCGCATCC AGACCCTAGT GGGCAACATC CACATCAAGC 1260 

AGGAGATGGA GGACGATTAC GACTACTACG GGCAGCAAAG GGTGCAGATC CTGGAACGCA 1320 

ACGAATCCGA GGAGTGCACG GAAGACACAG ACCAGGCCGA GGGCACCGAG AGTGAGCCCA 1380 

AAGGTGAAAG CTTCGACTCG GGCGTCAGCT CCTCCATAGG CACCGAGCCT GACTCGGTGG 1440 

AGCAGCAGTT TGGGCCTGGG GCGGCGCGGG ACAGCCAGGC TGAACCCACC CAACCCGAGC 1500 

70 AGGCTGCAGA AGCCCCCGCT GAGGGTGGTC CGCAGACAAA CCAGCTAGAA ACAGGTGCTT 1560 

CCTCTCCGGA GAGAAGCAAT GAAGTGGAGA TGGACAGCAC TGTTATCACT GTCAGCAACA 1620 

GCTCCGACAA GAGCGTCCTA CAACAGCCTT C GGTCAAC AC GTCCATCGGG CAGCCATTGC 1680 

CAAGTACCCA GCTCTACTTA CGCCAGACAG AAACCCTCAC CAGCAACCTG AGGATGCCTC 1740 

_ TGACCTTGAC CAGCAACACG CAGGTCATTG GCACAGCTGG CAACACCTAC CTGCCAGCCC 1800 

75 TCTTCACTAC CCAGCCCGCG GGCAGTGGCC CCAAGCCTTT CCTCTTCAGC CTGCCACAGC 1860 

CCCTGGCAGG CCAGCAGACC CAGTTTGTGA CAGTGTCCCA GCCCGGTCTG TCGACCTTTA 1920 

CTGCACAGCT GCCAGCGCCA CAGCCCCTGG CCTCATCCGC AGGCCACAGC ACAGCCAGTG 1980 

GGCAAGGCGA AAAAAAGCCT TATGAGTGCA CTCTCTGCAA CAAGACTTTC ACCGCCAAAC 2040 

AGAACTACGT CAAGCACATG TTCGTACACA CAGGTGAGAA GCCCCACCAA TGCAGCATCT 2100 

80 GTTGGCGCTC CTTCTCCTTA AAGGATTACC TTATCAAGCA CATGGTGACA CACACAGGAG 2160 
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TGAGGGCATA CCAGTGTAGT ATCTGCAACA AGCGCTTCAC CCAGAAGAGC TCCCTCAACG 2220 

TGCACATGCG CCTCCACCGG GGAGAGAAGT CCTACGAGTG CTACATCTGC AAAAAGAAGT 2280 

TCTCTCACAA GACCCTCCTG GAGCGACACG TGGCCCTGCA CAGTGCCAGC AATGGGACCC 2340 

CCCCTGCAGG CACACCCCCA GGTGCCCGCG CTGGCCCCCC AGGCGTGGTG GCCTGCACGG 2400 

5 AGGGGACCAC TTACGTCTGC TCCGTCTGCC CAGCAAAGTT TGACCAAATC GAGCAGTTCA 2460 

ACGACCACAT GAGGATGCAT GTGTCTGACG GATAAGTAGT ATCTTTCTCT CTTTCTTATG 2520 

AACAAAACAA AACAACAACA AAAAACAAAC AAACAAAAAA GCTATGGCAC TAGAATTTAA 2580 

GAAATGTTTT GGTTTCATTT TTACTTTCTG TTTTTGTTTT TGTTTCGTTT CATTTTGTAC 2640 

f „ TACATGAAGA ACTGTTTTTT GCCTGCTGGT ACATTACATT TCCGGAGGCT TGGGTGAATA 2700 

10 ATAGTTTTCC CAGTCTCCCT CGGATGGTGG CC TTAAGGCC TGGTAGTGCT TCAAGAGGTC 2760 

CACTGGTTGG ATCTCTAGCT ACTGGCCTCT AAATACAACC CTTCTTTACA AAAAAAAAAA 2820 
AAAAAAAAA 



45 



$EQIDNO:261 PgQ1 Prptein sequence: 
PBQ1 Protein sequence: NP_056457 

MTERfflSINL HNFSNSVLET LNEQRNRGHF CDVTVRIHGS MLRAHRCVLA AGSPEFQDKL 60 
LLGYSDIEIP SVVS VQSVQK LIDFMYSGVL RVSQSEALQI LTAAS1LQ1K TVIDECTRIV 120 
SQNVGDVFPG IQDSGQDTPR GTPESGTSGQ SSDTESGYLQ SHPQHSVDRI YSALYACSMQ 180 
NGSGERSFYS GAVVSHHETA LGLPRDHHME DPSW1TRIHE RSQQMERYLS TTPETTHCRK 240 
QPRPVRIQTL VGNIHIKQEM EDDYDYYGQQ RVQILERNES EECTEDTDQA EGTESEPKGE 300 
SFDSGVSSSI GTEPDSVEQQ FGPGAARDSQ AEPTQPEQAA EAPAEGGPQT NQLETGASSP 360 
ERSNEVEMDS TV1TVSNSSD KSVLQQPSVN TSIGQPLPST QLYLRQTETL TSNLRMPLTL 420 
TSNTQVIGTA GNTYLPALFT TQPAGSGPKP FLFSLPQPIA GQQTQFVTVS QPGLSTFTAQ 480 
LPAPQPLASS AGHSTASGQG EKKPYECTLC NKTFTAKQNY VKHMFVHTGE KPHQCSICWR 540 
SFSLKDYLIK HMVTHTGVRA YQCSICNKRF TQKSSLNVHM RLHRGEKSYE CYICKKKFSH 600 
KTLLERHVAL HS ASNGTPPA GTPPGARAGP PG VVACTEGT TYVCS VCPAK FDQIEQFNDH 660 
MRMHVSDG 

SEQ ID WO: 262 PBQ6DNA sequence 
Nucleic Acid Accession*: AI654187 

Coding sequence: 1 -S12 (underlined sequence corresponds to start and stop codon) 

l 11 21 31 41 51 

I I I I I I 

ATGGTGGAAG AGGAAACAGG CATATCTTAC ATGGTGGOAG ACAAGGGACA CCCTTCTACA 60 

AACTCTACCA CTTCTGCGCC GTCGTTTCGA CCATATAAAA ACGACCTATG CGAACTGCGT 120 

CGGAAAACTC CCTCACGATG TAAAACGAAG ATCAGGAGCA GATTTGAAGA ATTACAAAGT 180 

GAATTGGTGC CAGTCAGCAT GTCAGAGACA GACCACATAG CCTCTACTTC CTCTGATAAA 240 

AATGTTGGGA AAACACCTGA ATTAAAGGAA GACTCATGCA ACTTGTTTTC TGGCAATGAA 300 

A ACACTGATAA AACTTTATGT 360 

A TCGAATTGAA GCCCAGGAAA ATT AT ATTC C AGATCATGGT 420 

GGAGGTGAGG ATTCTTGTGC CAAAACAGAC ACAGGCTCAG AAAATTCTGA ACAAATAGCT 480 

AATTTTCCTA GTGGAAATTT TGCTAAACAT ATTTCAAAAA CAAATGAAAC AGAACAGAAA 540 

GTAACACAAA TATTGGTGGA ATTAAGGTCA TCTACATTTC CAGAATCAGC TAATGAAAAG 600 

ACTTATTCAG AAAGCCCCTA TGATACAGAC TGCACCAAGA AATTTATTTC AAAAATAAAG 660 

AGCGTTTCAG CATCAGAGGA TTTGTTGGAA GAAATAGAAT CTGAGCTCTT ATCTACGGAG 720 

50 TTTGCAGAAC ATCGAGTACC AAATGGAATG AATAAGGGAG AACATGCATT AGTTCTGTTT 780 

GAAAAGTGTG TGCAAGATAA ATATTTGCAG CAGGAACATA TCATAAAAAA GGCCAGACTT 840 

GGTCTCTGTT ATTTGCCATC AAGAACCTCA ATTGACACGT TAATTCCGTT TATCCCAAAT 900 
TTATATAGATAA 

55 

$EOipHp:263PBQ6Prp|einsequenge; 
Protein Accession*: NP_050170 

MEPKEATGKE NMVTKKKNLA FLRSRLYMLE RRKTDTVVES SVSGDHSGTL RRSQSDRTEY 60 
60 NQKLQEKMTP QGECS VAETL TPEEEHHMKR MMAKREK1IK ELIQTEKDYL NDLELCVREV 120 

VQPLRNKKTD RLDVDSLFSN IESVHQISAK LLSLLEEATT DVEPAMQVIG EVFLQIKGPL 180 
EDIYKIYCYH HDEAHSILES YEKEEELKEH LSHC1QSLK 



SEQIDHO:264PBY7DNAsa 



GGGCCTACTC TGCCGCCGCC GCCGCCCGCC CGCTCCAGCC GCCGCCGCCG CCGCCACCGC 
CCTCCAGGCT CCGGGACCCG GCCCGCGCCA CCGCCCCCGT GCGCGCCCCG CCGCCGCCGC 
CTTCGCCTTC GCCTTTTGTT TCCTCCGCTC CGGCGCCCCC GCCCCGGCTC GCGCTTTGCA 
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TGAGGGGAAG GGAGGGGGCG GGCAGGTGCA GCGGCCGGGC TAGTGGGAGG GGGCGGCGGC 
CATGGAGCGG GTGAACGACG CTTCGTGCGG CCCGTCTGGC TGCTACACAT ACCAGGTGAG 
CAGACACAGC ACGGAGATGC TGCACAACCT GAACCAGCAG CGCAAAAACG GCGGGCGCTT 
CTGCGACGTG CTCTTGCGGG TAGGCGACGA GAGCTTCCCA GCGCACCGCG CCGTGCTGGC 
CGCCTGCAGC GAGTACTTTG AGTCGGTGTT CAGCGCCCAG TTGGGCGACG GCGGAGCTGC 
GGACGGGGGT CCGGCTGATG TAGGGGGCGC GACGGCAGCA CCAGGCGGCG GGGCCGGGGG 
CAGCCGGGAG CTGGAGATGC ACACTATCAG CTCCAAGGTA TTTGGGGACA TTCTGGACTT 
CGCCTACACT TCCCGCATCG TGGTGCGCTT GGAGAGCTTT CCCGAACTCA TGACGGCCGC 1080 
CAAGTTCCTC CTGATGAGGT CGGTTATCGA GATCTGCCAG GAAGTCATCA AACAGTCCAA 
CGTACAGATC CTGGTACCCC CTGCCCGCGC CGATATAATG CTCTTTCGCC CCCCTGGGAC 1200 
CTCGGACTTG GGCTTCCCTT TGGACATGAC CAACGGGGCA GCCTTGGCAG CCAACAGCAA 
TGGCATCGCC GGCAGCATGC AGCCAGAGGA GGAGGCAGCT CGGGCGGCOCG GTGCAGCCAT 
TGCAGGCCAA GCCTCTTTGC CTGTGTTACC TGGGGTGGAC CGCTTGCCCA TGGTGGCTGG 
ACCCCTATCC CCCCAACTGC TGACTTCCCC ATTCCCCAGT GTGGCATCCA GTGCCCCTCC 
CCTGACTGGC AAGCGAGGCC GGGGCCGCCC AAGGAAGGCC AACCTGCTGG ACTCAATGTT 
TGGGTCCCCA GGGGGCCTGA GGGAGGCAGG CATCCTTCCA TGCGGTCTAT GTGGTAAGGT 
GTTCACTGAT GCCAACCGGC T 
GCTGGGCTAC ATCGACCTTC C 

AGACCCCGAC GGCCCCCGAA AGAGGAGCCG GACCAGGAAG CAGGTGGCTT S 
CGGCAAGATC TTCCGTGATG TGTATCATCT TAACCGGCAC AAGCTGTCCC A 
GAAGCCCTAC TCCTGCCCTG TGTGTGGGTT GCGGTTCAAG AGAAAAGACC G 
CCATGTGCGG TCCCATGATG GGTCCGTGGG CAAGCCTTAC ATCTGCCAGA GCTGTGGGAA 
AGGCTTCTCC AGGCCTGATC ACTTGAACGG ACATATCAAG CAGGTGCACA CTTCTGAGCG 
GCCTCACAAG TGTCAGACCT GCAATGCTTC TTTTGCCACC CGAGACCGTC TGCGCTCCCA 2040 
CCTGGCCTGT CATGAAGACA AGGTGCCCTG CCAGGTGTGT GGGAAGTACT TGCGGGCAGC 2100 
ATACATGGCA GACCACCTGA AGAAGCACAG CGAGGGGCCC AGCAACTTCT GCAGTATCTG 2160 
TAACCGAGGT TTCTCCTCTG CCTCCTACTT AAAGGTCCAT GTTAAAACCC ACCACGGTGT 2220 
TCCCCTTCCC CAGGTCTCCA GGCACCAGGA GCCCATCCTG AATGGGGGAG CAGCGTTCCA 2280 
CTGCGCCAGG ACCTATGGCA ACAAAGAAGG CCAGAAATGC TCACATCAGG ATCCGATTGA 2340 
GAGCTCTGAC TCCTATGGTG ACCTCTCAGA TGCCAGCGAC CTGAAGACGC CAGAGAAGCA 2400 
GAGTCCCAAT GGCTCTTTCT CCTGCGACAT GGCAGTCCCC AAAAACAAAA TGGAGTCTGA 2460 
TGGGGAGAAG AAGTACCCAT GCCCTGAATG TGGGAGCTTC TTCCGCTCTA AGTCCTACTT 2520 
GAACAAACAC ATCCAGAAGG TGCATGTCCG GGCTCTCGGG GGCCCCCTGG GGGACCTGGG 2580 
CCCTCCCCTT GGCTCACCTT TCTCTCCTCA GCAGAACATG TCTCTCCTCG AGTCCTTTGG 2640 
GTTTCAGATT GTTCAGTCGG CATTTGCGTC ATCTTTAGTA GATCCTGAGG TTGACCAGCA 2700 
GCCCATGGGG CCTGAAGGGA AATGAGGCAG CTGCTGTGTC CCCACGGAAA CAACCATCTG 2760 
GGGACTGCTG GGAAATGCTG TGAATGCGGA GGGAAGTGAT GTTTGGGTTC TGTAGCTGAG 2820 
AGATTTTTAT TCATTTTTAA CTGCCCCCCA ACCCCACTCC AACTCCTTCT CCACCACCCA 2880 
TTCTCCCAAT GGTCTTTAGA AATAGATTTT CATCTGATAT TCTGCAGAAA TATCAATGAG 2940 
ACTTGGTATG GGACAGGGGC AGAAAACACT ACATAGGCCT CCAAGGCAAA ACCAGTCCCA 3000 
GTTTCTTTAA TGGGAAGAAG CTGGAATTCC TGGTGCTCAA T1CTTAGTGA CCCCAATCCT 3060 
ATACCCAAAT CTATGATATT CTGGGACCTC AGTGATTTTG GTCCCCTCCC ACTTCTCTAG 3120 
TTCGTCATCC TCCCTTCCCA . TATCCTTCAA AAGAACCACA CTAGGGTCTC CACCTACTTA 3180 
TACAATGCGG ATGCCCAACT GTTTTTAAGG AAGCCAGAAG CATCCCATGG ACCATGGGGT 3240 
GAGTGTCCTC CAAGAGCCCC CTGAGCTCAG CCCTCTGCCT GGAGGGCTCC AGACCTTTCT 3300 
GAGCCCTGCT TGGAGGCGAG CATTTTCACT GCTAGGACAA GCTCAGCTGT TGAGGACACC 3360 
CCCACCCCAA ATTTCAGTTC TTACGTGATT TTAACCATTC AACATGCTGT TGGGTTTTAA 3420 
TTCTCTAATT ATTATTATTA TTGTTATTAT TTTTTAGGAC CAGTTGTAGT GAATTGCTAC 3480 
TGAAAGCTAT CCCAGGTGAT ACAGAGCTCT TTGTAAACCG CAGTCACACA TTAGGGTTAG 3540 
TATTAAACTT TGTTTAGATG TACCATAATT AACTTGGCTA GTTGATTGTT TGAAGTCTAT 3600 
GGAAGAAATA GTTTTATGCA AAATTTTAAA AAATGCCAGT CTGGTCAGGG AAGTAGGGGG 3660 
TTTCAATGCT GTTGGGAACC AGGAAGGTGG GACAGCCGGC AGGTAGGGAC ATTGTGTACC 3720 
TCAGTTGTGT CACATGTGAG CAAGCCCAGG TTGACCTTGT GATGTGAATT GATCTGATCA 3780 
GACTGTATTA AAAATGTTAG TACATTACTC TA 

gEQ ID Np:265 PBY7 Protein sequence: 
Protein Accession*: NP_114439 

MERVKDASCG PSGCYTYQVS RHSTEMLHNL NQQRKNGGRF CDVLLRVGDE SFPAHRAVLA 60 
ACSEYFESVF SAQLGDGGAA DGGPADVGGA TAAPGGGAGG SRELEMHTIS SKVFGDJLDF 120 
AYTSRIVVRL ESFPELMTAA. KFLLMRSVIE ICQEVtKQSN VQJLVPPARA DLMLFRPPGT 180 
SDLGFPLDMT NGAALAANSN GiAGSMQPEE EAARAAGAAI AGQASLPVLP GVDRLPMVAG 240 
PLSPQLLTSP FPSVASSAPP LTGKRGRGRP RKANLLDSMF GSPGGLREAG ILPCGLCGKV 300 
FTDANRLRQH EAQHGVTSLQ LGY1DLPPPR LGENGLPISE DFDGPRKRSR TRKQVACEIC 360 
GKITRDVYHL NRHKLSHSGE KPYSCPVCGL RFKRKDRMS Y HVRSUDGSVG KPYICQSCGK 420 
GFSRPDHLNG HIKQVHTSER PHKCQTCNAS FATRDRLRSH LACHEDKVPC QVCGKYLRAA 480 
YMADHLKKHS EGPSNFCSIC NREGQKCSHQ DPIESSDSYG DLSDASDLKT PEKQSANGSF 540 
SCDMA VPKNK MESDGEKKYP CPECGSFFRS KSYLNKHIQK VHVRALGGPL GDLGPALGSP 600 
FSPQQNMSLL ESFGFQIVQS AFASSLVDPE VDQQPMGPEG K 

?EQ|DNQ:266PBY?PNA^ 
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TGCCGCACCC GCCGCCTCCC GCCCCCAAAC CCCATCCCCG CGGTTGAGCC ACGATGAGCG 
GCAGAGTCGG CGATCTGAGC CCCAGGCAGA AGGAGGCATT GGCCAAGTTT CGGGAGAATG 
TCCAGGATGT GCTGCCGGCC CTGCCGAATC CAGATGACTA TTTTCTCCTG CGTTGGCTCC 
GAGCCAGAAG CTTCGACCTG CAGAAGTCGG AGGCCATGCT CCGGAAGCAT GTGGAGTTCC 
GAAAGCAAAA GGACATTGAC AACATCATTA GCTGGCAGCC TCCAGAGGTG ATCCAACAGT 
ATCTGTCAGG GGGTATGTGT GGCTATGACC TGGATGGCTG CCCAGTCTGG TACGACATAA 
TTGGACCTCT GGATGCCAAG GGTCTGCTGT TCTCAGCCTC CAAACAGGAC CTGCTGAGGA 
CCAAGATGCG GGAGTGTGAG CTGCTTCTGC AAGAGTGTGC CCACCAGACC ACAAAGTTGG 
GGAGGAAGGT GGAGACCATC ACCATAATTT ATGACTGCGA GGGGCTTGGC CTCAAGCATC 
TCTGGAAGCC TGCTGTGGAG GCCTATGGAG AGTTTCTCTG CATGTTTGAG GAAAATTATC 
CCGAAACACT GAAGCGTCTT TTTGTTGTTA AAGCCCCCAA ACTGTTTCCT GTGGCCTATA 
ACCTCATCAA ACCCTTCCTG AGTGAGGACA CTCGTAAGAA GATCATGGTC CTGGGAGCAA 
ATTGGAAGGA GGTTTTACTG AAACATATCA GCCCTGACCA GGTGCCTGTG GAGTATGGGG 
GCACCATGAC TGACCCTGAT GGAAACCCCA AGTGCAAATC CAAGATCAAC TACGGGGGTG 
ACATCCCCAG GAAGTATTAT GTGCGAGACC AGGTGAAACA GCAGTATGAA CACAGCGTGC 
AGATTTCCCG TGGCTCCTCC CACCAAGTGG AGTATGAGAT CCTCTTCCCT GGCTGTGTCC 
TCAGGTGGCA GTTTATGTCA GATGGAGCGG ATGTTGGTTT TGGGATTTTC CTGAAGACCA 
AGATGGGAGA GAGGCAGCGG GCAGGGGAGA TCACAGAGGT GCTGCCCAAC CAGAGGTACA 
ACTCCCACCT GGTCCCTGAA GATGGGACCC TCACCTGCAG TGATCCTGGC ATCTATGTCC 
TGCGGTTTGA CAACACCTAC AGCTTCATTC ATGCCAAGAA GGTCAATTTC ACTGTGGAGG 
TCCTGCTTCC AGACAAAGCC T 

AATAACACCT TCTCCTATAG CAGGCCTGGC CCCCTCAGTG TCTCCCTGTC A 
CCTTGTAGCA GTCATTTTCG CACAACCCTG A 
CCTCAGGAGC TTTCATTTCA G 



1740 
1800 

TCGCAATGAG GAGTAGCAGG GTAGCTGGTT GCTAGAGTTA CGGTGGGGAT CAGAAACTCT 1860 
TCCAAACATT TTAGCACTGA GGCTGGGGTA GCTTTTGGCT TTTCCCAGGT CTCAGGAGGT 
GGCCTGAGTC AGCACACATC TTCCCACTCG GTAGACAGGC TGGCCTCTCC CTCACTTTGA 

GACTtTGGCA ACTCCTGGGC CACACGGCCT GCCTCTTTGA TTACTAATGA TTGTCAGTGA 2040 

CTCAGAGCTT CCTCGGACTT CGGGTACCCA CCCGCTGTTC TCCATGCAAA CAAAGCGCCA 2100 

GGGAAATGAC CCACAGGGAT CGCAGCTGCA GGGAGGGCCA GGGAGGTTGG GGGTGGGAGT 2160 

GAATGCTAAA AGCAGATCGT CCAGTGCCCT TTTCAGTGCT ACCGGCCTCT CACCAAGCAG 2220 

TCCTCCATGT GAGCAACCCC GAGACAAAAA TGCTAAGTGG GATCAAGAGA GCAGCACTCG 2280 

GAGAGGGTGT TTGCCAGTCT GAGTGTCCCG CGGTGCCCGC CAACOCGCTT CCTGACTGAC 2340 

CTGAGCAAGG TCTTACTAAG CAGTCCCATC TCTGTGGGAG GCATGCAACG CGTGCAGGGA 2400 

GTTCAGGTGC CGGTCGGCGT AGCCAGGCCT GGAGGCCCCC CAGGCAGGAG GCCGCCCAAA 2460 



CAGCCCTTAC CCCAATCCCA CGAGCCCCGC CAACGAACCA CAGGTGCTGG GCTTTAGAGA 2580 

ACATGGGAAG GCGGCCCCAG ACCTGGCGGG AACGCCTTTC CCTCAGAGCC AGGCCCCGGC 2640 

COCGTCTGGG AAGCTCATCT TGCGAAGCTG AGGGAGCTCA GGGCAAAGGC CAGGCTAGCG 2700 

CGGACCGGAA GGGGCCGAGG CTGCACGGGC CTCTGCCAGA ACGCTCAGGA CATCCCGGCC 2760 
TGGGTTTACA ACGCTGTTAG GAAAATTAAC CAATGAATAA AGCAACGTTC AGTGCGCA 



SEP ID N0:267 PBY9 Protein sequence; 
Protein Accession*: NPJD36S61 

MSGRVGDLSP RQKEALAKFR ENVQDVLPAL PNPDDYFLLR WLRARSEDLQ KSEAMLRKHV 60 
EFRKQKDIDN USWQPPEVl QQYLSGGMCG YDLDGCPVWY DUGPLDAKG LLFSASKQDL 120 
LRTKMRECEL LLQECAHQTT KLGRKVE1IT UYDCEGLGL KHLWKPAVEA YGEFLCMFEE 180 
__ NYPETLKRLF VVKAPKLFFV AYNL1KPFLS EDTRKKIMVL GANWKEVLLK HISPDQVPVE 240 
55 YGGTMTDPDG NPKCKSKINY GGUIPRKYYV RDQVKQQYEH SVQISRGSSH QVEYEILFPG 300 
CVLRWQFMSD G ADVGFGIFL KTKMGERQRA GEMTEVLPNQ RYNSHLVPED GTLTCSDPGI 360 
YVLRFDNTYS FIHAKKVNFT VEVLLPDKAS EEKMKQLGAG TPK 



SEQ ID NO:268 PBH8 DNA seouence 
Nucleic Acid Accession*: XM_009756 

Coding sequence: 301-1440 (underlined sequence corresponds to start and stop codon) 

1 11 21 31 41 51 

I I I I I I 

GTGGGGACAG CCGAGCCGCG CCGGGCCCCT GGACGGCGTC GCCAAGGAGC TGGGATCGCA 
CTTGCTGCAG ACTTTGGATG GATTTGTTTT TGTGGTAGCA TCTGATCGCA AAATCATGTA 
TATATCCGAG ACCGCTTCTG TCCATTTAGG OTTATCCCAG GTGGAGCTCA CGGGCAACAG 
TATTTATGAA TACATCCATC CTTCTGACCA CGATGAGATG ACCGCTGTCC TCACGGCCCA 
CCAGCCGCTG CACCACCACC TGCTCCAAGG TATGAGATAG AGAGGTCGTT CTTTCTTCGA 
ATGAAATGTG TCTTGGCGAA AAGGAACGCG GGCCTGACCT GCAGCGGATA CAAGGTCATC 
CACTGCAGTG GCTACTTGAA GATCAGGCAG TATATGCTGG ACATGTCCCT GTACGACTCC 
TGCTACCAGA TTGTGGGGCT GGTGGCCGTG GGCCAGTCGC TGCCACCCAG TGCCATCACC 
GAGATCAAGC TGTACAGTAA CATGTTCATG TTCAGGGCCA GCCTTGACCT GAAGCTGATA 
TTCCTGGATT CCAGGGTGAC CGAGGTGACG GGGTACGAGC CGCAGGACCT GATCGAGAAG 1 
ACCCTATACC ATCACGTGCA CGGCTGCGAC GTGTTCCACC TCCGCTACGC ACACCACCTC 
CTGTTGGTGA AGGGCCAGGT CACCACCAAG TACTACCGGC TGCTGTCCAA GCGGGGCGGC 
TCGGTGTGGG TGCAGAGCTA CGCCACCGTG GTGCACAACA GCCGCTCGTC CCGGCCCCAC 
TGCATCGTGA GTGTCAATTA TGTACTCACG GAGATTGAAT ACAAGGAACT TCAGCTGTCC 
CTGGAGCAGG TGTCCACTGC CAAGTCCCAG GACTCCTCGA GGACCGCCTT GTCTACCTCA 

414 
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CAAGAAACTA GSAAATTAGT GAAACCCAAA AATACCAAGA TGAAGACAAA GCTGAGAACA 960 

AACCCTTACC CCCCACAGCA ATACAGCTCG TTCCAAATGG ACAAACTGGA ATGCGGCCAG 1020 

CTCGGAAACT GGAGAGCCAG TCCCCCTGCA AGCGCTGCTG CTCCTCCAGA ACTGCAGCCC 1080 

CACTCAGAAA GCAGTGACCT TCTGTACACG CCATCCTACA GCCTGCCCTT CTCCTACCAT 1140 

A CTCTCACGTC TTCAGCAGCA AAAAGCCAAT GTTGCCGGCC 1200 

3 ATCCCCTTGT GAGGTGGCAC GCTTTTTCCT GAGCACACTG 1260 

A GTGGCATTAT GCCAACCCCC TAGTGCCTAG CAGCTCGTCT 132 0 

\CCGGCG AACACTGCTA GGCACAGCCT GGTGCCAAGC 1380 

Z CTCTGCGGAG ATACCGCCAG CTCCCCAGGA CGCAGACTGA 1440 



Protein Accession* NPJM5060 

MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SDRLTTSYL KMRAVFPEGL 60 
GDAWGQPSRA GPLDGVAKEL GSHLLQTLDG FVFVVASDGK IMYISETASV HLGLSQVELT 120 
GNSIYEYIHP SDHDEMTAVL TAHQPLHHHL LQEYEIERSF FLRMKCVLAK RNAGLTCSGY 180 
KVIHCSGYLK RQYMLDMSL YDSCYQIVGL VAVGQSLPPS AITEIKLYSN MFMFRASLDL 240 
KLDFLDSRVT EVTGYEPQDL EKTLYHHVH GCDVFHLRYA HHLLLVKGQV TTKYYRIXSK 300 
RGGWVWVQSY ATVVHNSRSS RPHCIVSVNY VLTEEYKEL QLSLEQVSTA KSQDSWRTAL 360 
STSQETRKLV KPKNTKMKTK LRTNPYPPQQ YSSFQMDKLE CGQLGNWRAS PPASAAAPPE 420 
LQPHSESSDL LYTPSYSLPF SYHYGHFPLD SHVFSSKKPM LPAKfGQPQG SPCEVARFFL 480 
STLPASGECQ WHYANPLVPS SSSPAKNPPE PPANTARHSL VPSYEAPAAA VRRFGEDTAP 540 
PSFPSCGHYR EEPALGPAKA ARQAARDGAR LALARAAPEC CAPPTPEAPG APAQLPFVLL 600 
NYHRVLARRG PLGGAAPAAS GLACAPGGPE AATG ALRLRH PSPAATSPPG APLPHYLGAS 660 
VHTNGR 



GGCACGAGGA GAAGATGTGG CTTGCTCATG CTTGACTTCT GCCATGGTTG TGAGGCCTCC 60 
CCAGCCATGT GGAACTGTTT TCAGGTGCTG GTTCCATGGC TCTTCCTGAG CCGAAAATAA 120 
GGAAACTCCA TAGACCTTGT CCACTGGAAC TCGITCCCAT CTACCCTCCA CTCTATCCAG 1 80 
GGTGATGGAT CTCTGCAGTA AGTGGAAGAG TTCTTCATGG CCCCCAAGGT TATATCCATC 240 
TAGAACTTCA GCACGTAATT TCATCTGGAA ATAGTGCCTT TGTGGATATA AGTTAGGTAA 300 
AACTGAAGAT GAGATCATAC TGGATTAGGA TGGGATCTAA ATCCAATGAA AATGTCTTCA 360 
TAAAAAACAG GAAAGAACCC ATAGAAACAC AAGGAAGAAG GTCATGTGAA GATGGAGGCA 420 
GAGATTGGAG GGATGCAGCC ACCGGCCCAG GAATGCCAGC AGCCACCCAG AAGCTGGAAG 480 
GAAATGAGGG ATTCTCTCCT AGAACCTTTA GAGAGRACAT GGTCCTGTGA ACAGCTTGAT 540 
TTTGGACTTG CCCATAGCTT GTATACTCTT ACTTTGG ATA CAATTTTATC CAAACTTGGC 600 
TAAACAGTTT CTCAGCCTAT GGAAAATTTA AAATGGAGAA GATTCAACTC GATTCTTACA 660 
GATTCAAAGC AAGAAAATGA TGGGAACATA GGAGGAGACC AAGAAAGCCT ATAAAAAGCA 720 
AAAATATGAA GTGAACATTG TGGTAGCTTT AAGATGTTTA GTGTAGCTGC AGGCACCCTA 780 
TACACATGAA AACCCCCAAG GGGAATCCCC ATATCACAGT GTAGTGTGAT ATTTGACATT 840 
YGTGATCATY TAGAGATGTA CAGAAA AGGT GAATCTGTGT TCTGTATATT CTGCCTAAGG 900 
CAAAGAAATG TTTAGCTYTC TTTAAAATAG TTCCATAATT TTTTYTAAAA AGCTTTGCTT 960 
GAAAACTGTA AGCTTCCCAT ATCTGGAGCA TTTCACTTTA AATATTTGGA TAAATATGTT 1020 
ATCTTCTTAC TTGGACATTT CATGTGTTTA GGGATTGTYT TYTAAATTCT TCCTAATTCA 1080 
TATAGCTGCT AACACTTCCC GCAGAGCTAA ACCATTACAG ANTATG AAAT AAAGACCCTA 1 140 
TTGATTTGAA CTTAAAAAAA AAAAMAMAAA AAAAAAAAAA AAAAAAAAAT GA 

SEQjEnaaa pwpna w 



1 11 21 31 41 51 

I I I I I I 

ATGGAATCAA TCTCTATCAT GGGAAGCCCT AAGAGCCTTA GTGAAACTTG TTTACCTAAT 
GGCATAAATG GTATCAAAGA TGCAAGGAAG GTCACTGTAG GTGTGATTGG AAGTGGAGAT 
TTTGCCAAAT CCTTGACCAT TCGACTTATT AGATGCGGCT ATCATGTGGT CATAGGAAGT 
AGAAATCCTA AGTTTGCTTC TGAATTTTTT CCTCATGTGG TAGATGTCAC TCATCATGAA 
GATGCTCTCA CAAAAACAAA TATAATATTT GTTGCTATAC ACAGAGAACA TTATACCTCC 
CTGTGGGACC TCAGACATCT GCTTGTGGGT A 



TTGATTGTCA AAGGATTTAA TGTTGTCTCA GCTTGGGCAC TTCAGTTAGG ACCTAAGGAT 480 

GCCAGCCGGC AGGTTTATAT ATGCAGCAAC AATATTCAAG CGCGACAACA GGTTATTGAA 540 

CTTGCCCGCC AGTTGAATTT CATTCCCATT GACTTGGGAT CCTTATCATC AGCCAGAGAG 600 

ATTGAAAATT TACCCCTACG ACTCTTTACT CTCTGGAGAG GGCCAGTGGT GGTAGCTATA 660 

AGCTTGGCCA CATTTTTTTT CCTTTATTCC TTTGTCAGAG ATGTGATTCA TCCATATGCT 720 

AGAAACCAAC AGAGTGACTT TTACAAAATT CCTATAGAGA TTGTGAATAA AACCTTACCT 780 

ATAGTTGCCA TTACTTTGCT CTCCCTAGTA TACCTCGCAG GTCTTCTGGC AGCTGCTTAT 840 

CAACTTTATT ACGGCACCAA GTATAGGAGA TTTCCACCTT GGTTGGAAAC CTGGTTACAG 900 

TGTAGAAAAC AGCTTGGATT ACTAAGTTTT TTGTTCGCTA TGGTCCATGT TGCCTACAGC 960 

CTCTGCTTAC CGATGAGAAG GTCAGAGAGA TATTTGTTTC TCAACATGGC TTATCAGCAG 1020 

GTTCATGCAA ATATTGAAAA CTCTTGGAAT GAGGAAGAAG TTTGGAGAAT TGAAATGTAT 1080 

ATCTCCTTTG GCATAATGAG CCTTGGCTTA CTTTCCCTCC TGGCAGTCAC TTCTATCCCT 1140 

TCAGTGAGCA ATGCTTTAAA CTGGAGAGAA TTCAGTTTTA TTCAGTCTAC ACTTGGATAT 1200 



415 
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C TTTCCATGTT TTAATTTATG GATGGAAACG AGCTTTTGAG 
GAAGAGTACT ACAGATTTTA TACACCACCA AACTTTGTTC TTGCTCTTGT TTTGCCCTCA 
ATTGTAATTC T 



60 



1 11 21 31 41 51 

I I I I I I 

MESISMMGSP KSLSETCLPN GINGIKDARK VTVGVIGSGD FAKSLTIRLI RCGYHWIGS 
RNPKFASEFF PHWDVTHHE DALTKTNIIF VAIHREHYTS LWDLRHLLVG KILIDVSNNM 
RINQYPESNA EYLASLFPDS LIVKGFNWS AWALQLGPKD ASRQVYICSN NIQARQOVIE 
LARQLNFIPI DLGSLSSARE lEtttPLRLFT LWRGFWVAI SLATFFFLYS FVRDVIHPYA I 
RNQQSDFYKI PIEXVNKTLP IVAITLLSLV YLAGLLAAAY QLYYGTKYRR FPPWLETWLQ I 
CRKQLGLLSF FFAMVHVAYS LCLPMRRSER YLFLNMAYQQ VHANIENSWN EEEVWRIEMY 
ISFGIMSLGL LSLLAVTSIP SVSNALNWRE FSFIQSTLGY VALLISTFHV liIYGWKRAFE 
EEYYP.FYTPP NFVLALVLPS IV1LDLLQLC RYPD 

SEQ ID NO:273 PBQ5 DNA SEQUENCE 

Nxleic Acid Accession*: NMJM1973 

Coding sequence: 150-1445 (underlined sequence corresponds to start and stop codon) 

1 11 21 31 41 51 

I I I I I I 

CCGCCGCCTT CTACTCCGCC GCGGGGGTCG CAGCGGCTGC CGCGCCGTCC TCGAGTTTCC 

AGCGTGAGGA GGAGGCTGAG GGCGGAGAGG CGCATCGTGT TCGAGGCGGA GACCGAGGGG : 

GAGCCCCGCG CGCGGCGTCG CTCATTGC TA TG GACAGTGC TATCACCCTG TGGCAGTTCC : 

TTCTTCAGCT CCTGCAGAAG CCTCAGAACA AGCACATGAT CTGTTGGACC TCTAATGATG ' 

GGCAGTTTAA GCTTTTGCAG GCAGAAGAGG TGGCTCGTCT CTGGGGGATT CGCAAGAACA • 

AGCCTAACAT GAATTATGAC AAACTCAGCC GAGCCCTCAG ATACTATTAT GTAAAGAATA " 

TCATCAAAAA AGTGAATGGT CAGAAGTTTG TGTACAAGTT TGTCTCTTAT CCAGAGATTT < 

A TCCAATGACA GTGGGCAGGA TTGAGGGTGA CTGTGAAAGT TTAAACTTCA < 



CAGCCGAGAA ACTGGCAGAG AAAAAATCTC CTCAGGAGCC CACACCATCT GTCATCAAAT 
TTGTCACGAC ACCTTCCAAA AAGCCACCAG TTGAACCTGT TGCTGCCACC ATTTCAATTG 
GCCCAAGTAT TTCTCCATCT TCAGAAGAAA CTATCCAAGC TTTGGAGACA TTGGTTTCCC 
CAAAACTGCC TTCCCTGGAA GCCCCAACCT CTGCCTCTAA CGTAATGACT GCTTTTGCCA 
CCACACCACC CATTTCGTCC ATACCCCC1T TGCAGGAACC TCCCAGAACA CCTTCACCAC 
CACTGAGTTC TCACCCAGAC ATCGACACAG ACATTGATTC AGTGGCTTCT CAGCCAATGG 
AAC TTCCAGA GAATTTGTCT CTGGAGCCTA AAGACCAGGA TTCAGTCTTG CTAGAAAAGG 
ACAAAGTAAA TAATTCATCA AGATCCAAGA AACCCAAAGG GTTAGGACTG GCACCCACCC 
TTGTGATCAC GAGCAGTGAT CCAAGCCCAC TGGGAATACT GAGCCCATCT CTCCCTACAG 
CTTCTCTTAC ACCAGCATTT TTTTCACAGA CACCCATCAT ACTGACTCCA AGCCCCTTGC 
TCTCCAGTAT CCACTTCTGG AGTACTCTCA GTCCTCTTGC 1CCCCTAAGT CCAGCCAGAC 
TGCAAGGTGC TAACACACTT TTCCAGTTTC CTTCTGTACT GAACAGTCAT GGGCCATTCA 
CTCTGTCTGG GCTGGATGGA CCTTCCACCC CTGGCCCATT TTCCCCAGAC CTACAGAAGA 
CATAACCTAT GCACTTGTGG AATGAGAGAA CCGAGGAACG AAGAAACAGA CATTCAACAT 
GATTGCATTT GAAGTGAGCA ATTGATAGTT CTACAATGCT GATAATAGAC TATTGTGATT 
TTTGCCATTC CCCATTGAAA ACATCTTTTT AGGATTCTCT TTGAATAGGA CTCAAGTTGG 
ACTATATGTA TAAAAATGCC TTAATTGGAG TCTAAACTCC ACCTCCCTCT GTCTTTTCCT 
TTTCTTTTTC TTTCCTTCCT TCCTTTTCTT TTCTCCTTTA AAAATATTTT GAGCTTTGTG 
CTGAAGAAGT TTTTGGTGGG CTTTAGTGAC TGTGCTTTGC AAAAGCAATT AAGAACAAAG 
TTACTCCTTC TGGCTATTGG GACCCTTTGG CCAGGAAAAA TTATGCTTAG AATCTATTAT 
TTAAAGAAGT ATTTGTGAAA TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



SEQ ID NQ:274PBQ5 Protein sequence: 
Protein Accession*: NP_001964 

MDSA.ITLWQF LLQLLQKPQN KHMICWTSND GQFKLLQAEE V ARLWGERKN KPNMNYDKLS 60 
RALRYYYVKN IIKKVNGQKF VYKFVSYPEI LNMDPMTVGR IEGDCESLNF SEVSSSSKDV 120 
ENGGKDKPPQ PGAKTSSRND YIHSGLYSSF TLNSLNSSNV KLFKLKTEN PAEKLAEKKS 180 
PQEPTPSVIK FVTTPSKKPP VEPVAATISI GPSISPSSEE TIQALETLVS PKLPSLEAPT 240 
SASNVMTAFA TTPPISSIPP LQEPPRTPSP PLSSHPDIDT DIDSVASQPM ELPENLSLEP 300 
KDQDSVLLEK DKVNNSSRSK KPKGLGLAPT LVITSSDPSP LGILSPSLPT ASLTPAFFSQ 360 
TPIILTPSPL LSSIHFWSTL SPVAPLSPAR LQGANTLFQF PSVLNSHGPF TLSGLDGPST 420 
PGPFSPDLQK T 

SEQ ID NO:275 PBY3 DNA SEQUENCE 

Nucleic Acid Accession*: AB040921 

Coding sequence: 131-2560 (underlined sequence corresponds to start and stop codon) 
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AATCAGGAAC AGATCATATA TTGACCGAGA TTCTGAGTAT CTCTTGCAAG A 
GTATATTGAA ATGCAGCATT TCAGAGAAAA GCTGCCTTCG TATGGAATGC AAAAGGAATT 
AACCACTCAA GTTACTCAGT TCATTTTGGA TAACTACATT GAAAGAGGAA AAGGATCTGC 



AGCTGCAGAA AGGGCAGAAT CTTGTGGCAG TGGTAATAGT ACTGGATATC AAATTCGTCT 
CCAGAGTCGG TTGCCAAGGA AACAGGGTTC TATCTTATAC TGTACAACAG GAATCATCCT 
TCAGTGGCTC CAGTCAGACC CGTATTTGTC CAGTGTTAGT CATATCGTAC TTGATGAAAT 
CCATGAAAGA AATCTGCAGT CAGATCTTTT AATGACTGTT GTTAAAGACC T 



AACACAGATC 780 



AGCAATATAT AAAGAACGTT GGCCAGATTA TGTAAGGGAA CTGCGAAGAA GGTATTCTGC 900 
AAGTACTGTA GATGTTATAG AAATGATGGA GGATGATAAA G 
TGCCCTCATC CGATACATTG T 
AGGCIGGGAC AATATCAGCA CTTTACATGA TCTCTTGATG TCACAAGTAA TGTTTAAATC 1080 
AGATAAATTT TTAATTATAC CTTTACATTC ACTGATGCCT ACAGTTAACC AGACACAGGT 1140 
GTTTAAAAGA ACCCCTCCTG GTGTTCGGAA AATAGTAATT GCTACCAACA TTGCGGAGAC 1200 
TAGCATTACC ATAGATGATG TCGTTTATGT GATAGATGGA GGAAAAATAA AAGAGACGCA 1260 
TTTTGATACT CAGAACAATA TCAGTACAAT GTCCGCTGAG TGGGTTAGTA AAGCTAATGC 1320 
CAAACAGAGA AAAGGTCGAG CTGGAAGAGT TCAACCTGGT CATTGCTATC ATCTGTATAA 1380 
TGGTCTTAGA GCAAGTCTTC TAGATGACTA TCAACTGCCA GAAATTTTGA GAACTCCTTT 
GGAAGAACTT TGTTTACAAA TAAAGATTTT AAGGCTAGGT GGAATTGCTT ATTTTCTGAG 
TAGATTAATG GACCCACCAT CAAATGAGGC AGTGTTACTC TCCATAAGAC ACCTGATGGA 
GCTGAACGCT TTGGATAAAC AAGAAGAATT GACACCTCTT GGAGTCCACT TGGCACGATT 
ACCCGTTGAG CCACATATTG GAAAAATGAT TCTTTTTGGA GCACTGTTCT GCTGCTTAGA 
CCCAGTACTC ACTATTGCTG CTAGTCTCAG TTTCAAAGAT CCATTTGTCA TTCCACTGGG 
AAAAGAAAAG ATTGCAGATG CAAGAAGAAA GGAATTGGCA AAGGATACTA GAAGTGATCA 
CTTAACAGTT GTGAATGCGT TTGAGGGCTG GGAAGAGGCT AGGCGACGTG GTTTCAGATA 
CGAAAAGGAC TATTGCTGGG AATATTTTCT GTCTTCAAAC ACACTGCAGA TGCTGCATAA 
CATGAAAGGA CAGTTTGCTG AGCATCTTCT TGGAGCTGGA TTTGTAAGCA GTAGAAATCC 



TGCTGGTTTA TATCCCAAAG TTGCTAAAAT 



GGAGCAAACA GACTTTCACT ACAACTGGCT TATCTATCAC CTAAAGATGA GAACAAGCAG 2220 

T TTGGAGGTGA 2280 

CTGTATT 2340 

T TAAGGAATTA AGAAAGGAAC TAGATATTCT 2400 



CTGTGCAGTA CTGTCAGCTA T 
GAACTTTCCG CCACGATTCC A 
GAAAAGCCAG TTTGACAGCC A 

CTGGGACATG AACAATTTTC ATGTCTAAGG TAGAAGCCTT CAGTAGGTAG TAAAGACTTA 
ATGTGCATGA CTTGATGTTA TATGTAGAGA TATATATATA TATATATATA CCATAAAAGC 2760 
AATATGTTCT CTGATCATAT ACTCTGCTGT GGTCATGCCC ACTCTTTGGG AGTATATTCC 2820 
CTTTATATAT ATTGAGTATT GTACCACTTG AGAAATTCCT TTGTTCTGTT ATACAAAATT 2880 
AATCTTTCTG CTCATAATGA TTGATGATAC CACCAGTAAA AATAGGATGT TTACCCCAAA 2940 
ACAAGTGTCA ATTAAGAATT TGAACACAAC CACATTTTTT AAAATGAAAC TTCTATCGGA 3000 
AGTAAATTAA TTTGTTGTAA TAAAGTCCAG TATTTAATAA AATGTACAAT GTTAAATCTC 



KNRSYIDRD SEYLLQENEP DGTLDQKLLE DLQKKKNDLR YIEMQHFREK LPSYGMQKEL 60 
VNUDNHQVT V1SGETGCGK TTQVTQFILD NY1ERGKGSA CRIVCTQPRR ISAISVAERV 120 
AAERAESCGS GNSTGYQIRL QSRLPRKQOS ILYCTTGIIL QWLQSDPYLS SVSHIVLDEI 180 
HERNLQSDVL MTWKDLLNF RSDLKVILMS ATLNAEKFSE YFGNCPMIHI PGFTFPVVEY 240 
LLEDVIEKIR YVPEQKEHRS QFKRGFMQGH VNRQEKEEKE AIYKERWPDY VRELRERYS A 300 
STVD VIEMME DDKVDLNLIV ALIRYTVLEE EDGAILVFLP GWDNISTLHD LLMSQVMFKS 360 
DKFLIIPLHS LMPTVNQTQV FKRTPPGVRK IVIATNIAET SITIDDVVYV IDGGKIKETH 420 
FDTQNN1STM SAEWVSKANA KQRKGRAGRV QPGHCYHLYN GLRAS LLDD Y QLPEILRTPL 480 
EELCLQIKIL RLGGIAYFLS RLMDPPSNEA VLLSIRHLME LNALDKQEEL TPLGVHLARL 540 
PVEPfflGKMl LFGALFCCLD PVLTIAASLS FKDPFVIPLG KEKIADARRK ELAKDTRSDH 600 
LTVVNAFEGW EEARRRGFRY EKDYCWEYFL SSNTLQMLHN MKGQFAEHLL GAGFVSSRNP 660 
KDPESNINSD NEKIIKAVIC AGLYPKVAKI RLNLGKKRKM VKVYTKTDGL VAVHPKSVNV 720 
EQTDFHYNWL IYHLKMRTSS IYLYDCTEVS PYCLLFFGGD ISIQKDNDQE TIAVBEWIVF 780 
QSPARIAHLV KELRKELDIL LQEKESPHP VDWNDTKSRD CAVLSAIIDL IKTQEKATPR 840 
NFPPRFQDGYYS 



SEQ ID NO:277 PBY6 DNA SEQUENCE 
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CTGATGACAT ACTTCATCCA GCTGGGCTTT GTCGAGAGTC GATTCTTCCC GCCCACACGG 180 
CAGATGGGAC TCCTGTTCAC CTGGTATGAC TCTCTCACCG GGGTTCCGGT CAGCCAGCAG 240 
AACCTGCTGC TGGAGAAGGC CAGTGTCCTG TTCAACACTG GGGCCCTCTA CACCCAGATT 300 
GGGACCCGGT GTGATCGGCA GACGCAGGCT GGGCTGGAGA GTGCCATAGA TGCCTTTCAG 360 
AGAGCCGCAG GGGTTTTAAA TTACCTGAAA GACACATTTA CCCATACTCC AAGTTACGAC 420 
ATGAGCCCTG CCATGCTCAG CGTGCTCGTC AAAATGATGC TTGCACAAGC CCAAGAAAGC 480 
GTGTTTGAGA AAATCAGCCT TCCTGGGATC CGGAATGAAT TCTTCATGCT GGTGAAGGTG 540 
GCTCAGGAGG CTGCTAAGGT GGGAGAGGTC TACCAACAGC TACACGCAGC CATGAGCCAG 600 
GCGCCGGTGA AAGAGAACAT CCCCTACTCC TGGGCCAGCT TAGCCTGCGT GAAGGCCCAC 660 
CACTACGCGG CCCTGGCCCA CTACTTCACT GCCATCCTCC TCATCGACCA CCAGGTGAAG 720 
CCAGGCACGG ATCTGGACCA CCAGGAGAAG TGCCTGTCCC AGCTCTACGA CCACATGCCA 780 
GAGGGGCTGA CACCCTTGGC CACACTGAAG AATGATCAGC AGCGCCGACA GCTGGGGAAG 840 
TCCCACTTGC GCAGAGCCAT GGCTCATCAC GAGGAGTCGG TGCGGGAGGC CAGCCTCTGC 900 
AAGAAGCTGC GGAGCATTGA GGTGCTACAG AAGGTGCTGT GTGCCGCACA GGAACGCTCC 960 
CGGCTCACGT ACGCCCAGCA CCAGGAGGAG GATGACCTGC TGAACCTGAT CGACGCCCCC 1020 
AGTGTTGTTG CTAAAACTGA GCAAGAGGTT GACATTATAT TGCCCCAGTT CTCCAAGCTG 1080 
ACAGTCACGG ACTTCTTCCA GAAGCTGGGC CCCTTATCTG TGTTTTCGGC TAACAAGCGG 1 140 
TGGACGCCTC CTCGAAGCAT CCGCTTCACT GCAGAAGAAG GGGACTTGGG GTTCACCTTG 1200 
AGAGGGAACG CCCCCGTTCA GGTTCACTTC CTGGATCCTT ACTGCTCTGC CTCGGTGGCA 1260 
GGAGCCCGGG AAGGAGATTA TATTGTCTCC ATTCAGCTTG TGGATTGTAA GTGGCTGACG 1320 
CTGAGTGAGG TTATGAAGCT GCTGAAGAGC TTTGGCGAGG ACGAGATCGA GATGAAAGTC 1380 
GTGAGCCTCC TGGACTCCAC ATCATCCATG CATAATAAGA GTGCCACATA CTCCGTGGGA 1440 
ATGCAGAAAA CGTACTCCAT GATCTGCTTA GCCATTGATG ATGACGACAA AACTGATAAA 1500 
ACCAAGAAAA TCTCCAAGAA GCTTTCCTTC CTGAGTTGGG GCACCAACAA GAACAGACAG 1560 
AAGTCAGCCA GCACCTTGTG CCTCCCATCG OTCGGGGCTG CACGGCCTCA GGTCAAGAAG 1620 
AAGCTGCCCT CCCCITTCAG CCTTCTCAAC TCAGACAGTT CTTGGTACTAA 

SEQIDNO:278P BY6Pfoleinseouence: 



DFILEHYSED GYLYEDEIAD LMDLRQACRT PSRDEAGVEL LMTYHQLGF VESRFFPPTR 60 
QMGLLFTWYD SLTGVPVSQQ NLLLEKASVL FNTGALYTQI GTRCDRQTQA GLESA1DAFQ 1 20 
RAAGVLNYLK DTFTHTPSYD MSPAMLSVLV KMMLAQAQES VFEKISLPGI RNBFFMLVKV 180 
AQEAAKVGEV YQQLHAAMSQ APVKENIPYS WASLACVKAH HYAALAHYFT AILL1DHQVK 240 
PGTDLDHQEK CLSQLYDHMP EGLTPLATLK NDQQRRQLGK SHLRRAMAHH EESVREASLC 300 
KKLRStEVLQ KVLCAAQERS RLTYAQHQEE DDLLNLIDAP SVVAKTEQEV DDLPQFSKL 360 
TVTDFFQKLG PLS VFS ANKR WTPPRSIRFT AEEGDLGFTL RGNAPVQVHF LDPYCS AS VA 420 
GAREGDYIVS IQLVDCKWLT LSEVMKLLKS FGEDEIEMKV VSLLDSTSSM HNKS ATYSVG 480 
MQKTYSMICL AIDDDDKTDK TKKISKKLSF LSWGTNKNRQ KSASTLCLPS VGAARPQVKK 540 
KLPSPFSLLN SDSSWY 

SEQ ID N0:279 PBY8 DNA SEQUENCE 

Nucleic Acid Accession*: AF107493 

Coding sequence: 125-556 (underlined sequence corresponds lo start and stop codon) 

1 ll 21 31 41 51 

I I I I I 

CGAGCCTTGT TGGAGGTTCT GGGGCGCAGA ACCGCTACTG CTGCTTCGGT 60 

•CCTTGG GAAAAAATAA AATTTGAACC TTTTGGAGCT GTGTGCTAAA TCTTCAGTGG 120 

TCAGACAAAA GAGTGAGTAG AACAGAGCGT AGTGGAAGAT ACGGTTCCAT 180 

GATGACCGTG ATGAGCGTGA ATCCCGAAGC AGGCGGAGGG ACTCAGATTA 240 



360 

r GGTGAGCACG ACTATAGGCA TGACATCAGT GACGAGAGGG AGAGCAAGAC 420 

C CCATCACCAT CACAGAGAGC GATATTCGAG AAATGATGGA 480 

GTCCTTCGAA GGCCCTCAGC CTGCGGATGT GAGGCTGATG AAGAGGAAAA CAGGTGAGAG 540 

CTTGCTTAGT TCCTGATATT ATTGTTCTCT TCCCCATTCC CACCTCAGTC CCTAAAGAAC 600 

ATCCTGATTC CCCCAGTCTT CAAGCACATC AATTCAGAAT GAAAGGTTTG CCATGGCTAA 660 

GGAATGTGAC TCTTTGAAAA CCATGTTAGC ATCTGAGGAA CTTTTTTAAA CTTTGTTTrA 720 

GGGACTTTTT TTTCCTTAGG TAAGTAATGA TTTATAAACT CCTTTTTTTT TTTGACTATA 780 

GTCGGTTGCA TGGTTACTTT AAGCGTGGAA TCAAATGGAG TGGCATTTAG TTCAGGCGGC 840 

TTGTTCCTTG CCATGGCAAA GTATCAAGAA GATCCCCAAG TCAAGTCACA TTTGTAAAGC 900 

TGCTTCCCAA TTGGCTTTGT CACGCAGTGT TGAAGCAGTG GGAGAGAGAT TCACCTGTTA 960 

TAAAGGAACT GACTAACACA AGTATCCCGT CTATATCTGA ATGCTGTCTC TAGGTGTAAG 1020 

CCGTGGTTTC GCCTTCGTGG AGTTTTATCA CTTGCAAGAT GCTACCAGCT GGATGGAAGC 1080 

CAATCAGGTT GCTTCACTCA CCAAGTCTAG ATATTCATGA AAATGGAACA AGTCTGTACA 1140 

ATTTTAAAAA AAGGTK3AAG GAGTGGTTTG TTCCAAAGGA GTGACTTTTT TTTAAAAAAA 1200 

AAGCTTTGTA TATATTAAAA TTGATGTTAC TAGAATAAGT ACAGTACCAA GGACTTCATT 1260 

ATAGAATTTG TTCTGCCTTT AAACATGGCT ACCTACCTGG CAGGGCTTO3 TTAACTACTG 1320 

AATACCTGTC TGGTAATCAC TAAAACATCT TTATGTTTCC CTTTTTTCTA GTTTGTTATA 1380 

TTCCTATTAT GTCCATTGAG AGTAAGCTTA GTATATCAAA CTCTCCATTT GACAGTGAAG 1440 

AGAACATAGT GAAAGTCTGT GGCGGCATTT TTATAAGTAA TTCCTTATTT CTGCCTGAAG 1500 

AG CACAAAGC CTCCTGGAGG CGTAACTGCT CAGACCGGTC TTCAGGGAAT ATTTAAGGAC 1560 

TTAGTGGAAT TTATGAACAA TAAGTCTGAT GAGATTAGCC TGGSAGTCGT GTCCTGCAGC 1620 

TGTCTAATCT AGAGTGGCAT TAACATTCTA ATCTCCTTGA GAATGCCTTT TATAGTCTGT 1680 

TCAAAGCAAG TCATTGATGG TTCTTCGAGG TAGTGTTAAC TGAAGTGTTC TTCAGTTTGT 1740 

CAAGATAATG TTCAGTGCTT GGCACTTAAA TAACATTTTT TGCAAGAACT CCAAGGCACA 1800 



418 
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60 



TTATTGAATG CCTTTAACCA AGTGCATTCT GGGAAGTTTG CTTGACTCAT TATCTTGCTT 1860 

TTCTGCAGCA TTCTGTGATT TGAGTCATCC ATGAATCCAT GAATAAAAGT TACATTCTTT 1920 

GATTGGTAAT ATTGCCATTT ATAACAAGAC TCACTAATGA GGGTATCACT TTGACTGACT 1980 

GATTTGTTAA AGTTTTTAAG CCTCTCATTT TCCTAACCCA GAAATCACAG CCTGATTTTA 2040 

TTAAAAGTAG AGCTTCATTC ATTTCATACC ATAGATACCA TCCTAGTAAA TCCAGAACAT 2100 

ATACAAGGTT CATGTGAGTC TGCTTTCTTG ACATGATAGC ATTGTTTGAT GCAGTGGATA 2160 

TGTCAGAATG ACTAACCTAG GAGTTTGAAA CTCCTAAGAA ACTAAAACCT GTAAGACATT 2220 

TAAAAGTCTC CACAATTTTA ATGTATACAA AGCTATGTTA CTGTGTAACA CATTACAGTT 2280 

CAAATTCACT CCAGAAATAA AAGGCCAGTA GGATTAGGGA CTCACTGGTA GTTTGGAGTO 2340 

TCCCAGCACA CATCCCTCCT AGTGGGATGA TCTATTCACA TATCTCCCAG CTTTTTTATT 2400 

TTTGCTTCTG TATATCACAG TGAGTGGATG GCCCTTCAGC TTTTTCTCTC CTGGCCAGAC 2460 

ATGCAGTCTT GCCTTTAGAT ATCGCAGAGA CAAAATTCAC AGCATGTCTT AAATCTTCCA 2520 

GGATTTGCAA GAACCAAATT GCTCAACAGT ATGTATGTTT AGAGGGGTTA GACTCCTTTT 2580 

TAAAATCTGG ATATCTAACC ACCTACTTAA ATCTGTTTGA TAGTGTCAAA CCACCCCCAC 2640 
CCTTGATCCT CCCACCCCCA A 



MGSDKRVSRT ERSGRYGSII DRDDRDERES RSRRRDSDYK RSSDDRRGDR YDDYRDYDSP 60 

ERERERRNSD RSEDGYHSDG DYGEHDYRHD ISDERBSKTI MLRGLPITIT ESDKEMMES 120 
FEGPQPADVR LMKRKTGESL LSS 

25 

SEQ ID NO:281 PCI2 DNA SEQUENCE 

Nucleic Acid Accession*: AF208291 

Coding sequence; 109-3705 (underlined sequence corresponds to start and stop codon) 

30 1 11 21 31 41 si 

I I I I I I 

CGGCCGCTTT TTTCTCAAGA TGGCAGATTC CCACTGAGGC TGAGGGGGCC GAGCICGCGC 60 

GCCGCGTTCC CTTCTCCGTT GCCATGAACC GCGGACACCC CGGCCCCG AT G GCCCCCGTG 120 

TACGAAGGTA TGGCCTCACA TGTGCAAGTT TTCTCCCCTC ACACCCTTCA ATCAAGTGCC 180 

35 TTCTGTAGTG TGAAGAAACT AAAAGTAGAG CCAAGTTCCA ACTGGGACAT GACTGGGTAC 240 

GGCTCCCACA GCAAAGTGTA CAGCCAGAGC AAGAACATAC CACCTTCTCA GCCAGCCTCC 300 

ACAACCGTCA GCACCTCCTT GCCGGTCCCA AACCCAAGCC TACCTTACGA GCAGACCATC 360 

GTCTTCCCAG GAAGCACCGG GCACATCGTG GXCACCTCAG CAAGCAGCAC TTCTGTCACC 420 

GGGCAAGTCC TCGGCGGACC ACACAACCTA ATGCGTCGAA GCACTGTGAG CCTCCTTGAT 480 

40 ACCTACCAAA AATGTGGACT CAAGCGTAAG AGCGAGGAGA TCGAGAACAC AAGCAGCGTG 540 

CAGATCATCG AGGAGCATCC ACCCATGATT CAGAATAATG CAAGCGGGGC CACTGTCGCC 600 
ACTGCCACCA CGTCTACTGC CACCTCCAAA AACAGCGGCT C 
CAGCTGGTGC AGCATGAGGT G 

_ TTGGGCCGAG GGACGTTTGG ACAAGTGGTC AAGTGCTGGA AACGGGGCAC CAATGAGATC 780 

45 GTAGCCATCA AGATCCTGAA GAACCGCCCA TCCTATGCCC GACAAGGTCA GATTGAAGTG 840 

AGCATCCTGG CCCGGTTGAG CACGGAGAGT GCCGATGACT ATAACTTCGT CCGGGCCTAC 900 

GAATGCTTCC AGCACAAGAA CCACACGTGC TTGGTCTTCG AGATGTTGGA GCAGAACCTC 960 

TATGACTTTC TGAAGCAAAA CAAGTTTAGC CCCTTGCCCC TCAAATACAT TCGCCCAGTT 1020 

CTCCAGCAGG TAGCCACAGC CCTGATGAAA CTCAAAAGCC TAGGTCTTAT CCACGCTGAC 1080 

50 CTCAAACCAG AAAACATCAT GCTGGTGGAT CCATCTAGAC AACCATACAG AGTCAAGGTC 1140 

ATCGACTTTG GTTCAGCCAG CCACGTCTCC AAGGCTGTGT GCTCCACCTA CTTGCAGTCC 1200 

AGATATTACA GGGCCCCTGA GATCATCCTT GGTTTACCAT TTTGTGAGGC AATTGACATG 1250 

TGGTCCCTGG GCTGTGTTAT TGCAGAATTG TTCCTGGGTT GGCCGTTATA TCCAGGAGCT 1320 

TCGGAGTATG ATCAGATTCG GTATATTTCA CAAACACAGG GTTTGCCTGC TGAATATTTA 1380 

55 TTAAGCGCCG GGACAAAGAC AACTAGGTTT TTCAACCGTG ACACGGACTC ACCATATCCT 1440 

TTGTGGAGAC TGAAGACACC AGATGACCAT GAAGCAGAGA CAGGGATTAA GTCAAAAGAA 1500 

GCAAGAAAGT ACATTTTCAA CTGTTTAGAT GATATGGCCC AGGTGAACAT GACGACAGAT 1560 

TTGGAAGGGA GCGACATGTT GGTAGAAAAG GCTGACCGGC GGGAGTTCAT TGACCTGTTG 1620 

AAGAAGATGC TGACCATTGA TGCTGACAAG AGAATCACTC CAATCGAAAC CCTGAACCAT 1680 

CCCTTTGTCA CCATGACACA CTTACTCGAT TTTCCCCACA GCACACACGT CAAATCATGT 1740 

TTCCAGAACA TGGAGATCTG CAAGCGTCGG GTGAATATGT ATGACACGGT GAACCAGAGC 1800 
AAAACCCCTT TCATOACGCA CGTGGCCCCC AGCACGTCCA CCAACCTGAC C 
AACAACCAGC TGACCACTGT CCACAACCAG GCTCCCTCCT CTACCAGTGC C 

TTAGCCAATC CCGAAGTCTC CATACTAAAC TACCCATCTA CACTCTACCA GCCCTCAGCG 1980 

GCATCCATGG CTGCAGTGGC CCAGCGGAGC ATGCCCCTGC AGACAGGAAC AGCCCAGATT 2040 

TGTGCCCGGC CTGACCCGTT CCAGCAAGCT CTCATCGTGT GTCCCCCCGG CTTCCAAGGC 2100 

TTGCAGGCCT CTCCCTCTAA GCACGCTGGC TACTCGGTGC GAATGGAAAA TGCAGTTCCC 2160 

ATCGTCACTC AAGCCCCAGG AGCTCAGCCT CTTCAGATCC AACCAGGTCT GCTTGCCCAG 2220 

CAGGCTTX3GC CAAGTGGGAC CCAGCAGATC CTGCTTCCCC CAGCATGGCA GCAACTGACT 2280 

GGAGTGGCCA CCCACACATC AGTGCAGCAT GCCACCGTGA TTCCCGAGAC CATGGCAGGC 2340 

ACCCAGCAGC TGGCGGACTG GAGAAATACG CATGCTCACG GAAGCCATTA TAATCCCATC 2400 

ATGCAGCAGC CTGCACTATT GACCGGTCAT GTGACCCTTC CAGCAGCACA GCCCTTAAAT 2460 

GTGGGTGTGG CCCACGTGAT GCGGCAGCAG CCAACCAGCA CCACCTCCTC CCGGAAGAGT 2520 

AAGCAGCACC AGTCATCTGT GAGAAATGTC TCCACCTGTG AGGTGTCCTC CTCTCAGGCC 2580 

ATCAGCTCCC CACAGCGATC CAAGCGTGTC AAGGAGAACA CACCTCCCCG CTGTGCCATG 2640 

GTGCACAGTA GCCCGGCCTG CAGCACCTCG GTCACCTGTG GGTGGGGCGA CGTGGCCTCC 2700 

AGCACCACCC GGGAACGGCA GCGGCAGACA ATTGTCATTC CCGACACTCC CAGCCCCACG 2760 

GTCAGCGTCA TCACCATCAG CAGTGACACG GACGAGGAGG AGGAACAGAA ACACGCCCCC 2820 

ACCAGCACTG TCTCCAAGCA AAGAAAAAAC GTCATCAGCT GTGTCACAGT CCACGACTCC 2880 

CCCTACTCCG ACTCCTCCAG CAACACCAGC CCCTACTCCG TGCAGCAGCG TGCTGGGCAC 2940 
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AACAATGCCA ATGCCTTTGA CACCAAGGGG AGCCTGGAGA ATCACTGCAC GGGGAACCCC 3000 
CGAACCATCA TCGTGCCACC CCTGAAAACC CAGGCCAGCG AAGTATTGGT GGAGTGTGAT 3 060 
AGCCTGGTGC CAGTCAACAC CAGTCACCAC TCGTCCTCCT ACAAGTCCAA GTCC1CCAGC 3120 



CAGCACATCA CCACGGACCG CACTGGGAGC CACCGAAGGC AGCAGGCCTA CATCACTCCC 3300 

ACCATGGCCC AGGCTCCGTA CTCCTTCCCG CACAACAGCC CCAGCCACGG CACTGTGCAC 3360 

CCGCATCTGG CTGCAGCCGC TGCCGCTGCC CACCTCCCCA CCCAGCCCCA CCTCTACACC 3420 

TACACTGCGC CGGCGGCCCT GGGCTCCACC GGCACCGTGG CCCACCTGGT GGCCTCGCAA 3480 

GGCTCTGCGC GCCACACCGT GCAGCACACT GCCTACCCAG CCAGCATCGT CCACCAGGTC 3 540 

CCCGTGAGCA TGGGCCCCCG GGTCCTGOCC TCGCCCACCA TCCACCCGAG TCAGTATCCA 3 600 

GCCCAATTTG CCCACCAGAC CTACATCAGC GCCTCGCCAG CCTCCACCGT CTACACTGGA 3 660 

TACCCACTGA GCCCCGCCAA GGTCAACCAG TACCCTTACA TATAAACACT GGAGGGGAGG 3720 

GAGGGAGGGA GGGAGGGAGA GAATGGCCCG AGGGAGGAGG GAGAGAAGGA GGGAGGCGCT 3780 

CCTGGGACCG TGGGCGCTGG CCTTTTATAC TGAAGATGCC GCACACAAAC AATGCAAACG 3 840 

GGGCAGGGGC GGGGGGGGGG GGGGCAGAGG GCAGGGGGAC GGGTCGGGAC ACCAGTGAAA 3900 

CTTGAACCGG GAAGTGGGAG GACGTAGAGC AGAGAAGAGA ACATTTTTAA AAGGAAGGGA 3960 
1TAAAGAGGG TGGGAAATCT ATGGTTTTTA TTTTAAAAAA 



NP.073577 

MAFVYEGMAS HVQVFSPHTL QSSAFCSVKK LKVEPSSNWD MTGYGSHSKV YSQSKNIPPS 60 
QPASTTVSTS LPVPNPSLPY EQTIVFPGST GHIVVTSASS TSVTGQVLGG PHNLMRRSTV 120 
SLLDTYQKCG LKRKSEEIEN TSS VQIIEEH PPMIQNNASG ATVATATTST ATSKNSGSNS 180 
EGDYQLVQHE VLCSMTNTYE VLEELGRGTF GQVVKCWKRG TNEIVAIKIL KNRPS YARQG 240 
OJEVSILARL STES ADDYNF VRAYECFQHK NHTCLVFEML EQNLYDFLKQ NKFSPLPLKY 300 
IRPVLQQVAT ALMKLKSLGL IHADLKPENI MLVDPSRQPY RVKVIDFGSA SHVSKAVCST 360 
YLQSRYYRAP EIILGLPFCE A1DMWSLGCV IAELFLGWPL YPGASEYDQI RY1SQTQGLP 420 
AEYLLSAGTK TTRFFNRDTD SPYPLWRLKT PDDHEAETGI KSKEARKYIF NCLDDMAQVN 480 
MTTDLEGSDM LVEKADRREF IDLLKKMLTI DADKRITPIE TLNHPFVTMT HLLDFPHSTH 540 
VKSCFQNMEI CKRRVNMYDT VNQSKTPF1T HVAPSTSTNL TMTFNNQLTT VHNQAPSSTS 600 
ATISLANPEV SILNYPSTLY QPSAASMAAV AQRSMPLQTG TAQICARPDP FQQALIVCPP 660 
GFQGLQASPS KHAGYSVRME NAVPIVTQAP GAQPLQIQPG LLAQQAWPSG TQQILLPPAW 720 
QQLTGVATHT SVQHATVIPE TMAGTQQLAD WRNTHAHGSH YNPIMQQPAL LTGHVTLPAA 780 
QPLNVGVAHV MRQQPTSTTS SRKSKQHQSS VRNVSTCEVS SSQAISSPQR SKRVKENTPP 840 
RCAMVHSSPA CSTSVTCGWG DVASSTTRER QRQUVIPDT PSPTVSVITI SSDTDEEEEQ 900 
KHAPTSTVSK QRKNVISCVT VHDSPYSDSS SNTSPYS VQQ RAGHNNANAF DTKGSLENHC 960 
TGNPRTITVP PLKTQASEVL VECDSLVPVN TSHHSSSYKS KSSSNVTSTS GHSSGSSSGA 1020 
1TYRQQRPGP HFQQQQPLNL SQAQQHITTD RTGSHRRQQA YITPTMAQAP YSFPHNSPSH 1080 
GTVHPHLAAA AAAAHLPTQP HLYTYTAPAA LGSTGTVAHL VASQGSARHT VQHTAYPASI 1 140 
VHQVPVSMGP RVLPSPTIHP SQYPAQFAHQ TYIS ASPAST VYTGYPLSPA KVNQYPYI 



1 11 21 31 41 51 

I I I I I I 

AGTCACAGCC AGGTAACCCT GGAGTGAAGC GGTTTAGTTA GAAGGGAGCA GATAAACTCG 
TCACTCTAGT AGCTTTAACC CTCACCCTGA GGCACCTTAG CAATCAGCCA TTGCCTGCAA 
GCCTCCAAAG CTTGTCTTTG CCTAATATGG AGCCCAAAGA AGCCACTGGG AAAGAAAACA 
TGGTCACCAA GAAAAAGAAT CTGGCCTTCT TGAGGTCTAG ACTCTATATG CTGGAGAGAA 
GGAAGACTGA CACTGTCGTT GAGAGCAGTG TTTCTGGGGA CCACTCTGGC ACCTTGAGGA 
GGAGCCAATC TGACAGGACC GAATACAACC AGAAATTACA AGAAAAGATG ACTCCACAGG 
GTGAGTGTTC TGTAGCTGAG ACCTTAACCC CAGAGGAAGA GCATCATATG AAGAGGATGA 
TGGCAAAGCG GGAAAAGATC ATTAAGGAGC TCATACAGAC AGAAAAGGAT TATCTCAATG 
ATCTAGAGCT GTGTGTTAGG GAAGTGGTTC AGCCCCTGAG AAATAAAAAG ACTGATAGGC 
TGGATGTGGA TAGCTTGTTT AGCAACATTG AGTCCGTGCA TCAGATATCA GCCAAGCTGC 
TGTCATTGTT GGAAGAGGCC ACAACAGACG TGGAACCGGC CATGCAAGTA ATTGGAGAAG 
TATTCTTGCA GATTAAAGGG CCACTGGAAG ATATTTATAA AATCTACTGC TATCACCATG 
ATGAAGCACA TAGTATACTG GAGTCCTATG AAAAGGAAGA AGAGCTGAAG GAACATTTGA 
GCCACTGTAT CCAGTCCTTA AAGTAAGGCC TTTTCAAATG ATGATTCCCA TCTCCTCTCA 
GTTGCCTAGC AGGGAACATT TTAAATGGAT GTAGATGAAA GGTCTCACAT AAATCCTATG 
TTTTATGAGA CTTGCTGGGA GCTCTGCTTT GCATTCCCTT TATAAAAAGC TGACATGCCA 
GAAGCCCTGA TTGACTTTTT TTCCCCCTGC GAGAA1GACT AAAAATAACA TGGAAGAAGA 
TTTAGAGCTC "reCAGCGATT GAAAAATGCA ATATCAAAAT ATAAAATGTG GAAGAAAAGC 
CTCTTCTTAA AGCTATTGTA ACTTGCCTGG CCCCACGTAG TTCAAGGATT ATGTGAGATA 
ACACGTGGCC CCATGACCAC TGGAGCACAT GGGTTAATGG AGTTAGGGGA ATGGCCTACA 
ACTCTGCATG GCCGTCTTCT TTCCCCAAAC TCACTGTGGG GAGATGGGTG AAGACAAGTC 
AGGCCTTGTT AAAGTTAGTT TCAGAACAAT TACTCATGCC TTCCTTTCTC ATCCCTAAAA 
CATTGGTGGG GGAGCTACAC AATGTACTTT TTCTTTTCTA GAGGAAGTAT CTATTCACTG 
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MEPKEATGKE MMVTKKKNLA FLRSRLYMLE RRKTDTWES SVSGDHSGTL RRSQSDRTEY 
NQKLQEKMTP QGECSVAETL TPEEEHHMKR MMAKREKIIK ELIQTEKDYL HDLELCVREV 
VQPLRNKKTD RLDVDSLFSN IESVHQISAK LLSLLEEATT DVEPAMQVIG EVFLQIKGPL 
EDIYKIYCYH HDEAHSILES YEKEEELKEH LSHCIQSLK 

SEQ ID NO:285 PBQ9 DNA SEQUENCE 
523-2676 (underlined sequence corresponds to start and stop codon) 



CCCTTATGGC GATTGGGCGG CTGCAGAGAC CAGGACTCAG TTCCCCTGCC CTAGTCTGAG 
CCTAGTGGGT GGGACTCAGC TCAGAGTCAG TTTTCAGAAG CAGGTTTCAG TTGCAGAGTT 
TTCCTACACT TTTCCTGCGC TAGAGCAGCG AGCAGCCTGG AACAGACCCA GGCGGAGGAC 
ACCTGTGGGG GAGGGAGCGC CTGGAGGAGC TTAGAGACCC CAGCCGGGCG TGATCTCACC 
ATGTGCGGAT TTGCGAGGCG CGCCCTGGAG CTGCTAGAGA TCCGGAAGCA CAGCCCCGAG 
GTGTGCGAAG CCACCAAGAC TGCGGCTCTT GGAGAAAGCG TGAGCAGGGG GCCACCGCGG 
TCTCCGGCCT GTCTGCACCC TGTCGCCTGA GCTGCCTGAC AGTGACAATG ACATCCCAGT 
TACCAGTGTC CTTGAATTGA TAGTGGCTTC TGTTTGTCAG TCTCATATAA GAACTACAGC 
TCATCAGGAG GAGATCGCAG CAGGGTAAGA GACACCAACA CCATGTTCTG CACGAAGCTC 
AAGGATCTCA AGATCACAGG AGAGTGTCCT TTCTCCTTAC TGGCACCAGG TCAAGTTCCT 
AACGAGTCTT CAGAGGAGGC AGCAGGAAGC TCAGAGAGCT GCAAAGCAAC CGTCCCCATC 
TGTCAAGACA TTCCTGAGAA GAACATACAA GAAAGTCTTC CTCAAAGAAA AACCAGTCGG 
AGCCGAGTCT ATCTTCACAC TTTGGCAGAG AGTATTTGCA AACTGATTTT CCCAGAGTTT 
GAACGGCTGA ATGTTGCACT TCAGAGAACA TTGGCAAAGC ACAAAATAAA A3AAAGCAGG 
AAATCTTTGG AAAGAGAAGA CTTTGAAAAA ACAATTGCAG AGCAAGCAGT GCAGCAGAGT 
CCAGTGGAGT TATCAAAGAA TCTCTTGGTG AAGAGGTTTT TAAAATATGT TACGAGGAAG 
ATGAAAACAT CCTTGGGGTG GTTGGAGGCA CCCTTAAAGA TTTTTAAACA GCTTCAGTAC 
CCTTCTGAAA CAGAGCAGCC ATTGCCAAGA AGCAGGAAAA AGGGGCAGCT TGAGGACGCC 
TCCATTCTAT GCCTGGATAA GGAGGATGAT TTTCTACATG TTTACTACTT CTTCCCTAAG 
AGAACCACCT CCCTGATTCT TCCCGGCATC ATAAAGGCAG CTGCTCACGT ATTATATGAA 
ACGGAAGTGG AAGTGTCGTT AATGCCTCCC TGCTTCCATA ATGATTGCAG CGAGTTTGTG 
AATCAGCCCT ACTTGTTGTA CTCCGTTCAC ATGAAAAGCA CCAAGCCATC CCTGTCCCCC 
AGCAAACCCC AGTCCTCGCT GGTGATTCCC ACATCGCTAT TCTGCAAGAC ATTTCCATTC 
CATTTCATGT TK3ACAAAGA TATGACAATT CTGCAATTTG GCAATGGCAT CAGAAGGCTG 
ATGAACAGGA GAGACTTTCA AGGAAAGCCT AATTTTGAAT ACTTTGAAAT TCTGACTCCA 
AAAATCAACC AGACCTTTAG CGGGATCATG ACTATGTTGA ATATGCAGTT TGTTGTACGA 
GTGAGGAGAT GGGACAACTC TGTGAAGAAA TCTTCAAGGG TTATGGACCT CAAAGGCCAA 
ATGATCTACA TTGTTGAATC CAGTGCAATC TTGTTTTTGG GGTCACCCTG TGTGGACAGA 
TTAGAAGATT TTACAGGACG AGGGCTCTAC CTCTCAGACA TCCCAATTCA CAATGCACTG 1740 



GGGAAGCTGA AGGCTACCCT TGAGCAAGCC CACCAAGCCC TGGAGGAGGA GAAGAAAAAG 1860 
G CTCAGCAGCT GTGGCAAGGG 
A GTTCAGTAAT GTCACCATGC TCTTCTCAGA CATCGTTGGG 

TTCACTGCCA TCTGCTCCCA GTGCTCACCG CTGCAGGTCA TCACCATGCT CAATGCACTG 2040 

TACACTCGCT TCGACCAGCA GTGTGGAGAG CTGGATGTCT ACAAGGTGGA GACCATTGCG 2100 

ATGCCTATTG TGTGGCTTGG GGGATTACAC AAAGAGAGTG ATACTCATGC TGTTCAGATA 2160 

GCGCTGATGG CCCTGAAGAT GATGGAGCTC TCTGATGAAG TTATGTCTCC CCATGGAGAA 2220 

CCTATCAAGA TGCGAATTGG ACTGCACTCT GGATCAGTTT TTGCTGGCGT CGTTGGAGTT 2280 

AAAATGCCCC GTTACTGTCT TTTTGGAAAC AATGTCACTC TGGCTAACAA ATTTGAGTCC 2340 

TGCAGTGTAC CACGAAAAAT CAATGTCAGC CCAACAACTT ACAGATTACT CAAAGACTGT 2400 

CCTGGTTTCG TGTTTACCCC TCGATCAAGG GAGGAACTTC CACCAAACTT CCCTAGTGAA 2460 

ATCCCCGGAA TCTGCCATTT TCTGGATGCT TACCAACAAG GAACAAACTC AAAACCATGC 2520 

TTCCAAAAGA AAGATGTGGA AGATGCAAGC CAATTTTTTA GGCAAAGCAT CAGGAATAGA 2580 

TTAGCAACCT ATATACCTAT TTATAAGTCT TTGGGGTTTG ACTCATTGAA GATGTGTAGA 2640 

GCCTCTGAAA GCACTTTAGG GATTGTAGAT GGCTAACAAG CAGTATTAAA ATTTCAGGAG 2700 

CCAAGTCACA ATCTTTCTCC TGTTTAACAT GACAAAATGT ACTCACTTCA GTACTTCAGC 2760 

TCTTCAAGAA AAAAAAAAAA ACCTTAAAAA GCTACTTTTG TGGGAGTATT TCTATTATAT 2820 

AACCAGCACT TACTACCTGT ACTCAAAATT CAGCACCTTG TACATATATC AGATAATTGT 2880 

AGTCAATTGT ACAAACTGAT GGAGTCACCT GCAATCTCAT ATCCTGGTGG AATGCCATGG 2940 

TTATTAAAGT GTGTTTGTGA TAGTTGTCGT CAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 3000 

SEQ ID NO:286 PBQ9 Protein sequence: 
Protein Accession*: Q02108 

1 11 21 31 41 51 

I I I I I I 

MPCTKLKDLK ITGECPFSLL APGQVPNESS EEAAGSSESC KATVPICQDI PEKNIQESLP 60 

QRKTSRSRVY LHTLAESICK LIFPEFERRJ VALQRTLAKH KIKESRKSLE REDFEKTIAE 120 

QAVAAGVPVE VIKESLGEEV FKICYEEDEN ILGWGGTLK DFLNSFSTLL KQSSHCQEAG 180 

KRGRLEDA3I LCLDKEDDFL HVYYFFPKRT TSLILPGIIK AAAHVLYETE VEVSLHPPCF 240 

HHDCSEFVNQ PYLLYSVHMK STKPSLSPSK POSSLVIPTS LFCKTFPFHF MFDKDHTILQ 300 

FGNGIRRLMN RRDFQGKPNF EEYFEILTPK INQTFSGIMT HLNMQFWRV RRWDNSVKKS 360 

SRVMDLKGQM IYIVESSAIL FLGSPCVDRL EDFTGRGLYL SDIPIHNALR DWLIGEQAR 420 

AQDGLKKRLG KLKATLEQAH QALEEEKKKT VDLLCSIFPC EVAQQLWQGQ WQAKKFSNV 480 

TMLFSDWGF TAICSCCSPL QVITMLNALY TRFDQQCGEL CVYKVETIGD AYCVAGGLHK 540 
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3 SVFAGWGVK MPRYCLFGNN 



SEQ ID NO:287 PFD2 DNA SEQUENCE 

Nucleic Acid Accession*: NM_000720 

Coding sequence: 1 19-6564 (underlined sequence corresponds to start and stop codon) 

1 11 21 31 41 51 

I I I I I I 

AGAATAAGGG CAGGGACCGC GGCTCCTATC TCTTGGTGAT CCCCTTCCCC ATTCCGCCCC 
CGCCTCAACG CCCAGCACAG TGCCCTGCAC ACAGTAGTCG CTCAATAAAT GTTCGTGGAT 
GATGATGATG ATGATGATGA AAAAAATGCA GCATCAACGG CAGCAGCAAG CGGACCACGC 
GAACGAGGCA AACTATGCAA GAGGCACCAG ACTTCCTCTT TCTGGTGAAG GACCAACTTC 
TCAGCCGAAT AGCTCCAAGC AAACTGTCCT GTCTTGGCAA « 
k CTATGAGCAC CTCTGCACCC C 
A CGTCAGCAAT ACGCCAAGAG CAAAAAACAG GGTAACTCGT CCAACAGCCG 
ZU ACCTGCCCGC GCCCTTTTCT GTTTATCACT CAATAACCCC ATCCGAAGAG CCTGCATTAG 
TATAGTGGAA TGGAAACCAT TTGACATATT TATATTATTG GCTATTTTTG CCAATTGTGT 
GGCCTTAGCT ATTTACATCC CATTCCCTGA AGATGATTCT AATTCAACAA ATCATAACTT 
GGAAAAAGTA GAATATGCCT TCCTGATTAT TTTTACAGTC GAGACATTTT TGAAGATTAT 
AGCGTATGGA TTATTGCTAC ATCCTAATGC TTATGTTAGG AATGGATGGA ATTTACTGGA 
25 TTTTGTTATA GTAATAGTAG GATTGTTTAG TGTAATTTTG GAACAATTAA CCAAAGAAAC 
AGAAGGCGGG AACCACTCAA GCGGCAAATC TGGAGGCTTT GATGTCAAAG CCCTCCGTGC 
CTTTCGAGTG TTGCGACCAC TTCGACTAGT GTCAGGGGTG CCCAGTTTAC AAGTTGTCCT 
GAACTCCATT ATAAAAGCCA TGGTTCCCCT CCTTCACATA GCCCTTTTGG TATTATTTGT 
AATCATAATC TATCCTATTA TAGGATTGGA ACTTTTTATT GGAAAAATGC ACAAAACATG 
30 TTTTTTTGCT GACTCAGATA TCGTAGCTGA AGAGGACCCA GCTCCATGTG CGTTCTCAGG 
GAATGGACGC CAGTGTACTG CCAATGGCAC GGAATGTAGG AGTGGCTGGG TTGGCCCGAA 
CGGAGGCATC ACCAACTTTG ATAACTTTGC CTTTGCCATG CTTACTGTGT TTCAGTGCAT 
CACCATGGAG GGCTGGACAG ACGTGCTCTA CTGGGTAAAT GATGCGATAG GATGGGAATG 
GCCATGGGTG TATTTTGTTA GTCTGATCAT CCTTGGCTCA TTTTTCGTCC TTAACCTGGT 
35 TCTTGGTGTC CTTAGTGGAG AATTCTCAAA GGAAAGAGAG AAGGCAAAAG CACGGGGAGA 
TTTCCAGAAG CTCCGGGAGA AGCAGCAGCT GGAGGAGGAT CTAAAGGGCT ACTTGGATTG 
GATCACCCAA GCTGAGGACA TCGATCCGGA GAATGAGGAA GAAGGAGGAG AGGAAGGCAA 
ACGAAATACT AGCATGCCCA CCAGCGAGAC TGAGTCTGTG AACACAGAGA ACGTCAGCGG 
TGAAGGCGAG AACCGAGGCT GCTGTGGAAG T 
40 GGCCAAGGCG G 

™ ^ 1860 

ATATTTCGTC TCTCTTTTCA ACCGGTTTGA TTGCTTCGTG GTGTGTGGTG GAATCACTGA 19 B0 

GCGCCTCTTA AGAATCTTCA AAGTGACCAG GCACTGGACT TCCCTGAGCA ACTTAGTGGC 2100 

ATCCTTATTA AACTCCATGA AGTCCATCGC TTCGCTGTTG CTTCTGCTTT TTCTCTTCAT 2160 

TATCATCTTT TCCTTGCTTG GGATGCAGCT GTTTGGCGGC AAGTTTAATT TTGATGAAAC 2220 

GCAAACCAAG CGGAGCACCT TTGACAATTT CCCTCAAGCA CTTCTCACAG TGTTCCAGAT 2280 

CCTGACAGGC GAAGACTGGA ATGCTGTGAT GTACGATGGC ATCATGGCTT ACGGGGGCCC 2340 

ATCCTCTTCA GGAATGATCG TCTGCATCTA CTTCATCATC CTCTTCATTT GTGGTA&CTA 2400 

TATTCTACTG AATGTCTTCT TGGCCATCGC TGTAGACAAT TTGGCTGATG CTGAAAGTCT 2460 

GAACACTGCT CAGAAAGAAG AAGCGGAAGA AAAGGAGAGG AAAAAGATTG CCAGAAAAGA 2520 

GAGCCTAGAA AATAAAAAGA ACAACAAACC AGAAGTCAAC CAGATAGCCA ACAGTGACAA 25 B0 

CAAGGTTACA ATTGATGACT ATAGAGAAGA GGATGAAGAC AAGGACCCCT ATCCGCCTTG 2640 

CGATGTGCCA GTAGGGGAAG AGGAAGAGGA AGAGGAGGAG GATGAACCTG AGGTTCCTGC 2700 

CGGACCCCGT CCTCGAAGGA TCTCGGAGTT GAACATGAAG GAAAAAATTG CCCCCATCCC 2760 

TGAAGGGAGC GCTTTCTTCA TTCTTAGCAA GACCAACCCG ATCCGGGTAG GCTGCCACAA 2820 

GCTCATCAAC CACCACATCT TCACCAACCT CATCCTTGTC TTCATCATGC TGAGCAGCGC 2 8 BO 

TGCCCTGGOC GCAGAGGACC CCATCCGCAG CCACTCCTTC CGGAACACGA TACTGGGTTA 2940 

CTTTGACTAT GCCTTCACAG CCATCTTTAC TGTTGAGATC CTGTTGAAGA TGACAACTTT 3000 

TGGAGCTTTC CTCCACAAAG GGGCCTTCTG CAGGAACTAC TTCAATTTGC TGGATATGCT 3060 

GGTGGTTGGG GTGTCTCTGG TGTCATTTGG GATTCAATCC AGTGCCATCT CCGTTGTGAA 3120 

GATTCTGAGG GTCTTAAGGG TCCTGCGTCC CCTCAGGGCC ATCAACAGAG CAAAAGGACT 3180 

TAAGCACGTG GTCCAGTGCG TCTTCGTGGC CATCCGGACC ATCGGCAACA TCATGATCGT 3240 
CACTACCCTC C 

CTACAAGGAT GGGGATGTTG ACAGTCCTGT GGTCCGTGAA CGGATCTGGC AAAACAGTGA 3420 

TTTCAACTTC GACAACGTCC TCTCTGCTAT GATGGCGCTC TTCACAGTCT CCACGTTTGA 3480 

GGGCTGGCCT GCGTTGCTGT ATAAAGCCAT CGACTCGAAT GGAGAGAACA TCGGCCCAAT 3540 

CTACAACCAC CGCGTGGAGA TCTCCATCTT CTTCATCATC TACATCATCA TTGTAGCTTT 3600 

CTTCATGATG AACATCTTTG TGGGCTTTGT CATCGTTACA TTTCAGGAAC AAGGAGAAAA 3660 

AGAGTATAAG AACTGTGAGC TGGACAAAAA TCAGCGTCAG TGTGTTGAAT ACGCCTTGAA 3720 

AGCACGTCCC TTGCGGAGAT ACATCCCCAA AAACCCCTAC CAGTACAAGT TCTGGTACGT 3780 

GGTGAACTCT TCGCCTTTCG AATACATGAT GTTTGTCCTC ATCATGCTCA ACACACTCTG 3840 

CTTGGCCATG CAGCACTACG AGCAGTCCAA GATGTTCAAT GATGCCATGG ACATTCTGAA 3900 

CATGGTCTTC ACCGGGGTGT TCACCGTCGA GATGGTTTTG AAAGTCATCG CATTTAAGCC 3960 

TAAGGGGTAT TTTAGTGACG CCTGGAACAC GTTTGACTCC CTCATCGTAA TCGGCAGCAT 4020 

TATAGACGTG GCCCTCAGCG AAGCGGACCC AACTGAAAGT GAAAATGTCC CTGTCCCAAC 4080 
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TGCTACACCT GGGAACTCTG AAGAGAGCAA TAGAATCTCC ATCACCTTTT TCCGTCTTTT 4140 

CCGAGTGATG CGATTGGTGA AGCTTCTCAG CAGGGGGGAA GGCATCCGGA CATTGCTGTG 4200 

GACTTTTATT AAGTCCTTTC AGGCGCTCCC GTATGTGGCC CTCCTCATAG CCATGCTGTT 4260 

CTTCATCTAT GCGGTCATTG GCATGCAGAT GTTTGGGAAA GTTGCCATGA GAGATAACAA 4320 

5 CCAGATCAAT AGGAACAATA ACTTCCAGAC GTTTCCCCAG GCGGTGCTGC TGCTCTTCAG 4380 

GTGTGCAACA GGTGAGGCCT GGCAGGAGAT CATGCTGGCC TGTCTCCCAG GGAAGCTCTG 4440 

TGACCCTGAG TCAGATTACA ACCCCGGGGA GGAGTATACA TGTGGGAGCA ACTTTGCCAT 4500 

TGTCTATTTC ATCAGTTTTT ACATGCTCTG TGCATTTCTG ATCATCAATC TGTTTGTGGC 4560 

TGTCATCATG GATAATTTCG ACTATCTGAC CCGGGACTGG TCTATTTTGG GGCCTCACCA 4620 

10 TTTAGATGAA TTCAAAAGAA TATGGTCAGA ATATGACCCT GAGGCAAAGG GAAGGATAAA 4680 

ACACCTTGAT GTGGTCACTC TGCTTCGACG CATCCAGCCT CCCCTGGGGT TTGGGAAGTT 4740 

ATGTCCACAC AGGGTAGCGT GCAAGAGATT AGTTGCCATG AACATGCCTC TCAACAGTGA 4800 

CGGGACAGTC ATGTTTAATG CAACCCTGTT TGCTTTGGTT CGAACGGCTC TTAAGATCAA 4860 

GACCGAAGGG AACCTGGAGC AAGCTAATGA AGAACTTCGG GCTGTGATAA AGAAAATTTG 4920 

15 GAAGAAAACC AGCATGAAAT TACTTGACCA AGTTGTCCCT CCAGCTGGTG ATGATGAGGT 4980 

AACCGTGGGG AAGTTCTATG CCACTTTCCT GATACAGGAC TACTTTAGGA AATTCAAGAA 5040 

ACGGAAAGAA CAAGGACTGG TGGGAAAGTA CCCTGCGAAG AACACCACAA TTGCCCTACA 5100 

GGCGGGATTA AGGACACTGC ATGACATTGG GCCAGAAATC CGGCGTGCTA TATCGTGTGA 5160 

TTTGCAAGAT GACGAGCCTG AGGAAACAAA ACGAGAAGAA GAAGATGATG TGTTCAAAAG 5220 

20 AAATGGTGCC CTGCTTGGAA ACCATGTCAA TCATGTTAAT AGTGATAGGA GAGATTCCCT 5280 

TCAGCAGACC AATACCACCC ACCGTCCCCT GCATGTCCAA AGGCCTTCAA TTCCACCTGC 5340 

AAGTGATACT GAGAAACCGC TGTTTCCTCC AGCAGGAAAT TCGGTGTGTC ATAACCATCA 5400 

TAACCATAAT TCCATAGGAA AGCAAGTTCC CACCTCAACA AATGCCAATC TCAATAATGC 5460 

CAATATGTCC AAAGCTGCCC ATGGAAAGCG GCCCAGCATT GGGAACCTTG AGCATGTGTC 5520 

25 TGAAAATGGG CATCATTCTT CCCACAAGCA TGACCGGGAG CCTCAGAGAA GGTCCAGTGT 5580 

GAAAAGAACC CGCTATTATG AAACTTACAT TAGGTCCGAC TCAGGAGATG AACAGCICCC 5640 

AACTATTTGC CGGGAAGACC CAGAGATACA TGGCTATTTC AGGGACCCCC ACTGCTTGGG 5700 

GGAGCAGGAG TATTTCAGTA GTGAGGAATG CTACGAGGAT GACAG CTCGC CCACCTGGAG 5760 

CAGGCAAAAC TATGGCTACT ACAGCAGATA CCCAGGCAGA AACATCGACT CTGAGAGGCC 5820 

30 CCGAGGCTAC CATCATCCCC AAGGATTCTT GGAGGACGAT GACTCGCCCG TTTGCTATGA 5880 

TTCACGGAGA TCTCCAAGGA GACGCCTACT ACCTCCCACC CCAGCATCCC ACCGGAGATC 5940 

CTCCTTCAAC TTTGAGTGCC TGCGCCGGCA GAGCAGCCAG GAAGAGGTCC CGTCGTCTCC 6000 

CATCTTCCCC CATCGCACGG CCCTGCCTCT GCATCTAATG CAGCAACAGA TCATGGCAGT 6060 

TGCCGGCCTA GATTCAAGTA AAGCCCAGAA GTACTCACCG AGTCACTCGA CCCGGTCGTG 6120 

35 GGCCACCCCT CCAGCAACCC CTCCCTACCG GGACTGGACA COGTGCTACA CCCCCCTGAT 6180 

CCAAGTGGAG CAGTCAGAGG CCCTGGACCA GGTGAACGGC AGCCTGCCGT CCCTGCACCG 6240 

CAGCTCCTGG TACACAGACG AGCCCGACAT CTCCTACCGG ACTTTCACAC CAGCCAGCCT 6300 

GACTGTCCCC AGCAGCTTCC GGAACAAAAA CAGCGACAAG CAGAGGAGTG CGGACAGCTT 6360 

GGTGGAGGCA GICCTGATAT CCGAAGGCTT GGGACGCTAT GCAAGGGACC CAAAATTTGT 6420 

40 GTCAGCAACA AAACACGAAA TCGCTGATGC CTGTGACCTC ACCATCGACG AGATGGAGAG 6480 

TGCAGCCAGC ACCCTGCTTA ATGGGAACGT GCGTCCCCGA GCCAACGGGG ATGTGGGCCC 6540 

CCTCTCACAC CGGCAGGACT ATGAGCTACA GGACTTTGGT CCTGGCTACA GCGACGAAGA 6600 

GCCAGACCCT GGGAGGGATG AGGAGGACCT GGCGGATGAA ATGATATGCA TCACCACCTT 6660 

GTAG CCCCCA GCGAGGGGCA GACTGGCTCT GGCCTCAGGT GGGGCGCAGG AGAGCCAGGG 6720 

45 GAAAAGTGCC TCATAGTTAG GAAAGTTTAG GCACTAGTTG GGAGTAATAT TCAATTAATT 6780 

AGACTTTTGT ATAAGAGATG TCATGCCTCA AGAAAGCCAT AAACC TGGTA GGAACAGGTC 6840 

CCAAGCGGTT GAGCCTGGCA GAGTACCATG CGCTCGGCCC CAGCTGCAGG AAACAGCAGG 6900 

CCCCGCCCTC TCACAGAGGA TGGGTGAGGA GGCCAGACCT GCCCTGCCCC ATTGTCCAGA 6960 

TGGGCACTGC TGTGGAGTCT GCTTCTCCCA TGTACCAGGG CACOAGGCCC ACCCAACTGA 7020 

50 AGGCATGGCG GCGGGGTGCA GGGGAAAGTT AAAGGTGATG ACGATCATCA CACCTCGTGT 7080 

CGTTACCTCA GCCATCGGTC TAGCATATCA GTCACTGGGC CCAACATATC CATTTTTAAA 7140 
CCCTTTCCCC CAAATACACT GCGTCCTGGT TCCTGTTTAG CTGTTCTGAA ATA 

SEQ ID NO:2B8 PFD2 Protein sequence: 
55 Protein Accession* A381S8 



1 11 21 31 41 51 

I I I I I I 

MMMMMMMKKM QHQRQQQADH ANEANYARGT RLPLSGEGPT SQPNSSKQTV LSWQAAIDAA 60 

60 RQAKAAQTMS TSAPPPVGSL SQRKRQQYAK SKKQGNSSNS RPARALFCLS LNNPIRRACI 120 

SIVEWKPFDI FILLAIFANC VALAIYIPFP EDDSNSTNHN LEKVEYAFLI IFTVETFLKI 180 

IAYGLLLHPM AYVRNGWNLL DFVIVIVGLF SVILEQLTKE TEGGNHSSGK SGGFDVKALR 240 

AFRVLRPLRL VSGVPSLQW LNSIIKAMVP LLHIALLVLF VIIIYAIIGL ELFIGKMHKT 300 

CFFADSD1VA BEDPAPCAFS GNGRQCTANG TECRSGWVGP NGGITNFDNF AFAMLTVFQC 360 

65 ITMEGWTDVL YWVMDAIGWE WPWVYFVSLI ILGSFFVLNL VLGVLSGEFS KEREKAKARG 420 

DFQKLREKQQ LEEDLKGYLD WITQAEDIDP ENEEEGGEEG KRNTSMPTSE TESVNTENVS 480 

GEGENRGCCG SLWCWWRRRG AAKAGPSGCR RWGQAISKSK LSRRWRRWNR FNRRRCRAAV 540 

KSVTFYWLVI VLVFLNTLTI SSEHYNQPDW LTQIQDIANK VLLALFTCEK LVKHYSLGLQ 600 

AYFVSLFNRF DCFWCGGIT ETILVELEIM SPLGISWRC VRLLRIFKVT RHWTSLSNLV 660 

70 ASLLNSMKSI ASLLLLLFLF IIIFSLLGMQ LFGGKFNFDE TQTKRSTFDN FPQALLTVFQ 720 

I LTGEDWNAV HYDGIMAYGG PSSSGMIVCI YFIILFICGN YILLNVFLAI AVDNLADAES 780 

LNTAQKEEAE EKERKKIARK ESLENXKNNK PEVNQIANSD NKVTIDDYRE EDEDKDPYPP 840 

CDVPVGEEEE EEEEDEPEVP AGPRPRRISE LNMKEKIAPI PEGSAFFILS KTNPIRVGCH 900 

KLINHHIFTW LILVFIMLSS AALAAEDPIR SHSFRNTILG YFDYAFTAIF TVEILLKMTT 960 

75 FGAFLKKGAF CRNYFNLLDM LWGVSLVSF GIQSSAISW KILRVLRVLR PLRAINRAKG 1020 

LKHWQCVFV AIRTTGNIMI VTTLLQFMFA CIGVQLFKGK FYRCTDEAKS NPEECRGLFI 1080 

LYKDGDVDSP WRERIWQNS DFNFDNVLSA MMALFTVSTF EGWPALLYKA IDSNGENIGP 1140 

IYNHRVEISI FFIIYIIIVA FFMMNIFVGF VIVTFQEQGE KEYKNCELDK NQRQCVEYAL 1200 

KARPLRRY1P KNPYQYKFWY WNSSPFEYM MFVLIMLNTL CIjAMQHYEQS KMFNDAMDIL 1260 

80 NMVFTGVFTV EKVLKVIAFK PKGYFSDAWN TFDSLIVIGS IIDVALSEAD PTBSENVPVP 1320 
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TATPGNSEES NRISITFFRL FRVMRLVKLL SRGEGIRTLL WTFIKSFQAL PYVALLIAML 13 BO 

FFIYAVIGMQ HFGKVAMRDN NQINRNNNFQ TFFQAVLLLF RCATGEAVJQE IMLACLPGKL 1440 

CDPESDYNPG EEYTCGSMFA IVYFISFYML CAFLIINLFV AVIMDNFDYL TRDWSILGPH 15Q0 

HLDEFKRIWS EYDPEAKGRI KHLDWTLLR RIQPPLGPGK LCPHRVACKR LVAMNMPLHS 1560 

DGTVMFNATL FALVRTALKI KTEGNLEQAN EELRAVIKKI WKKTSMKLLD QWPPAGDDE 1620 

VTVGKFYATF LIQDYFRKFK KRKEQGLVGK YPAKNTTIAL QAGLRTLHDI GPEIRRAISO 16SO 

DLQDDEPEET KREEEDDVFK RNGALLGNHV NHVNSDRRDS LQQTNTTHRP LHVQRPSIPP 1740 

ASDTEKPLFP PAGNSVCHNH HNHNSIGKQV PTSTNANLNN ANMSKAAHGK RPSIGNLEHV 1800 

SENGHHSSHK HDREPQRRSS VKRTRYYETY IRSDSGDEQL PTICREDPEI HGYFRDPHCL 1860 

GEQEYFSSEE CYEDDSSPTW SRQNYGYYSR YPGRNIDSER PRGYHHPQGF LEDDDSFVCY 1920 

DSRKSPRRRL LPPTPASHRR SSFNFECLRR QSSQEEVPSS PIFPHRTALP LHLMQQQIMA 1980 

VAGLDSSKAQ KYSPSHSTRS WATPPATPPY RDWTPCYTPL 1QVEQSEALD QVNGSLPSLH 2040 

RSSWYTDEPD IEYRTFTPAS LTVPSSFRNK NSDKQRSADS LVEAVLISEG LGRYARDPKF 2100 

VSATKHEIAD ACDLTIDEME SAASTLLNGN VRPRANGDVG PLSHRQDYEL QDFGPGYSDE 2160 
EPDPGRDEED LADEMICITT L 

SEQ ID HO:289 0BI6 DNA SEQUENCE 

Nucleic Acid Accession*: NMJW2812 



1 11 21 31 41 

I I I I I 

AACTCCCGCC TCGGGACGCC TCGGGGTCGG G 

GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 120 

CCTCAGCTCC TTTTCCTGAG CCCGCCGC GA TG GGAGCTGC GCGGGGATCC CCGGCCAGAC 180 

CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGGCGGT ACCCAGACAG 240 

CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 420 

ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATCTC ACTGGAGAAG 480 

AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC 540 

TGAAGCATCC AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 600 

ACATOGATGG GCACCCTCGG CCCAOCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGACGCTC CGGCCAGCTG 720 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTG 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 

AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 

GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TCCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

TGGATTCCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATCG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 1740 

AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 1S00 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTCCCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 2100 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTCG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

H AACGCCACAG CACAAGTGAT AAGATGCACT 2520 

3 CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

A GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

2700 
2760 
2820 

3 AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATCATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 
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CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGAGTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

C TTTTGAC AC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 
TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 



1 11 21 31 41 51 

I I I I I I 

HGAARGSPAR PRRLPLLSVL LLPLLGGTQT AIVFIKQPSS QDALQGRRAL LRCEVEAPGP 
VHVYWLLDGA PVQDTERRFA OGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARSANASFN 
IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 
KERNLTLR PA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DESFARWLA PQDVWARYE 
EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 
CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 
VRLPTHGRVY OKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 
SQLEEGKPGY LDCLTQATPK PTWWYRNQM LIEEDSRFEV FKNGTLRINS VEVYDGTWYR 
CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 
GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPOGQI RAHVQLTVAV FITFKVEPER 
TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 
RYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYHAVL 
GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 
KRHSTSDKHH FPRSSLOPIT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 
LDFRRELEHF GKLHHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 
PLSTKQKVAL CTQVALGMEH LSMNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 
YHFRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEMPHGGQ ADDEVLADLQ 1 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 

SEQ ID NO;291 AAB1 DNA SEQUENCE 

Nucleic Add Accession*: NM.002205 



ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTCCC GCTGCTCTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGA1TCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTOCT 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTOT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 
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GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTGAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCC TATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTCCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTCCCTCGGC AGCTGCCCCA AAAAGAGCGT 2940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTCGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 



Protein Accession ft 



MGSRTPESPL HAVQLRWG PR RRPPLLPLLL LLLPPPPRVG GPNLDAEAPA VLSGPPGSFF 
GFSVEFYRPG TDGVSVLVGA PKAKTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 
G EEPVEYKSLQ WFGATVRAHG SSILACAPLY S' 

E QMASYFGYAV 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YFDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QKOKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

EiSPIHIAtiNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQHHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTASPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRIiLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALSGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVCWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 

SEQ ID NO:293 IBH4 DNA SEQUENCE 



1 11 21 31 41 51 
I I I I I I 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCTCGCCTT 60 
GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 120 
AGATCCAGAG GACTCCCAGC GAACGGACGA GGGTGACAAT AGAGTGTGGT GTCATGTTTG 180 
TGAGAGAGAA AACACTTTCG AGTGCCAGAA CCCAAGGAGG TGCAAATGGA CAGAGCCATA 240 
CTGCGTTATA GCGGCCGTGA AAATATTTCC ACGTTTTTTC ATGGTTGCGA AGCAGTGCTC 300 
CGCTGGTTGT GCAGCGATGG AGAGACCCAA GCCAGAGGAG AAGCGGTTTC TCCTGGAAGA 360 
GCCCATGCCC TTCTTTTACC TCAAGTGTTG TAAAATTCGC TACTGCAATT TAGAGGGGCC 420 
ACCTATCAAC TCATCAGTGT TCAAAGA ATA TGCTGGGAGC ATGGGTGAGA GCTGTGGTGG 480 
GCTGTGGCTG GCCATCCTCC TGCTGCTGGC CTCCATTGCA GCCGGCCTCA GCCTGTCTTG. 540 
AGCCACGGGA CTGCCACAGA CTGAGCCTTC CGGAGCATGG ACTCGCTCCA GACCGTTGTC 600 
ACCTGTTGCA TTAAACTTGT TTTCTGTTGA TTACCTCTTG GTTTGACTTC CCAGGGTCTT 660 
GGGATGGGAG AGTGGGGATC AGGTGCAGTT GGCTCTTAAC CCTCAAGGGT TCTTTAACTC 720 
ACATTCAGAG GAAGTCCAGA TCTCCTGAGT AGTGATTTTG GTGACAAGTT TTTCTCTTTG 780 
AAATCAAACC TTGTAACTCA TTTATTGCTG ATGGCCACTC TTTTCCTTGA CTCCCCTCTG 840 
CCTCTGAGGG CTTCAGTATT GATGGGGAGG GAGGCCTAAG TACCACTCAT GGAGAGTATG 900 
TGCTG AGATG CTTCCGACCT TTCAGGTGAC GCAGG AACAC TGGGGGAGTC TGAATG ATTG 960 
GGGTGAAGAC ATCCCTGGAG TGAAGGACTC CTCAGCATGG GGGGCAGTGG GGCACACGTT 1020 
AGGGCTGCCC CCATTCCAGT GGTGGAGGCG CTGTGGATGG CTGCTTTTCC TCAACCTTTC 1080 
CTACCAGATT CCAGG AGGCA GAAGATAACT AATTGTGTTG AAGAAACTTA GACTTCACCC 1 140 
ACCAGCTGGC ACAGGTGCAC AGATTCATAA ATTCCCACAC GTGTGTGTTC AACATCTGAA 1200 
ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA ATGGCGTTCA TTTCTCTGTT AAGATGCAGC 1260 
CATCCATGGG GAGCTGAGAA ATCAGACTCA AAGTTCCACC AAAAACAAAT ACAAGGGGAC 1320 
TTCAAAAGTT CACGA AAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 
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SEQlPW;S4tfH4Pr?WnCTWs 
Protein Accession *: AAH01291 



5 1 11 21 31 41 51 

MALLALLLVV ALPRVWTDAN LTARQRDPED SQRTDEGDNR VWCHVCEREN TFECQNPRRC 60 
KWTEPYCVIA AVKIFPRFFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPF FYLKCCKIRY 120 
CNLEGPPINS SVFKEYAGSM GESCGGLWLA ILLLLASIAA GLSLS 



15 

It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
20 application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting a prostate cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-16. 

1 2. The method of claim 1, wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1-16. 

1 3. The method of claim 1, wherein the biological sample is a tissue 

2 sample. 

1 4. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5 . The method of claim 4, wherein the nucleic acids are mRNA. 

1 6. The method of claim 4, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-16. 

1 8 . The method of claim 1 , wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 10. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 11. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat prostate cancer. 

1 12. The method of claim 1, wherein the patient is suspected of having 

2 prostate cancer. 
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1 13. A method of monitoring the efficacy of a therapeutic treatment of 

2 prostate cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a prostate cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-16, 

8 thereby monitoring the efficacy of the therapy. 

1 14. The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the prostate cancer-associated transcript to a level of the prostate cancer- 

3 associated transcript in a biological sample from the patient prior to, or earlier in, the 

4 therapeutic treatment. 

1 15. The method of claim 13, wherein the patient is a human. 

1 16. A method of monitoring the efficacy of a therapeutic treatment of 

2 prostate cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a prostate cancer-associated antibody in the 

6 biological sample by contacting the biological sample with a polypeptide encoded by a 

7 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

8 as shown in Tables 1-16, wherein the polypeptide specifically binds to the prostate cancer- 

9 associated antibody, thereby monitoring the efficacy of the therapy. 

1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the prostate cancer-associated antibody to a level of the prostate cancer- 

3 associated antibody in a biological sample from the patient prior to, or earlier in, the 

4 therapeutic treatment. 

1 18. The method of claim 16, wherein the patient is a human. 
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1 19. A method of monitoring the efficacy of a therapeutic treatment of 

2 prostate cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a prostate cancer-associated polypeptide in the 

6 biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables 1-16, thereby monitoring 

9 the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the prostate cancer-associated polypeptide to a level of the prostate cancer- 

3 associated polypeptide in a biological sample from the patient prior to, or earlier in, the 

4 therapeutic treatment. 

1 21 . The method of claim 19, wherein the patient is a human. 

1 22. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-16. 

1 23 . The nucleic acid molecule of claim 22, which is labeled. 

1 24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

1 25. An expression vector comprising the nucleic acid of claim 22. 

1 26. A host cell comprising the expression vector of claim 25. 

1 27. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-16. 

1 28 . An antibody that specifically binds a polypeptide of claim 27. 

1 29. The antibody of claim 28, further conjugated to an effector component. 
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1 30. The antibody of claim 29, wherein the effector component is a 

2 fluorescent label. 

1 31. The antibody of claim 29, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 32. The antibody of claim 29, which is an antibody fragment. 

1 33. The antibody of claim 29, which is a humanized antibody 

1 34. A method of detecting a prostate cancer cell in a biological sample 

2 from a patient, the method comprising contacting the biological sample with an antibody of 

3 claim 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 

1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to prostate cancer in a 

2 patient, the method comprising contacting a biological sample from the patient with a 

3 polypeptide encoded by a nucleic acid comprises a sequence from Tables 1-16. 

1 38. A method for identifying a compound that modulates a prostate cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a prostate cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 3 8, wherein the functional effect is a physical 

2 effect. 
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1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. 

1 41. The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43. The method of claim 38, wherein the polypeptide is recombinant. 

1 44. A method of inhibiting proliferation of a prostate cancer-associated 

2 cell to treat prostate cancer in a patient, the method comprising the step of administering to 

3 the subject a therapeutically effective amount of a compound identified using the method of 

4 claim 38. 

1 45. The method of claim 44, wherein the compound is an antibody. 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having prostate cancer or a 

3 cell isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-16 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of prostate cancer. 

1 48. The assay of claim 47, wherein the control is a mammal with prostate 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 
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1 50. A method for treating a mammal having prostate cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 5 1 . A pharmaceutical composition for treating a mammal having prostate 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 

1 52. The method according to claim 1, wherein said biological sample is 

2 contacted with a plurality of polynucleotides comprising a first polynucleotide that 

3 selectively hybridizes to a sequence at least 80% identical to a first sequence as shown in 

4 Tables 1-16; and a second polynucleotide that selectively hybridizes to a second sequence at 

5 least 80% identical to a second sequence as shown in Tables 1-16. 

1 53. A method according to claim 52, wherein the plurality of 

2 polynucleotides comprises a third polynucleotide that selectively hybridizes to a sequence at 

3 least 80% identical to a third sequence as shown in Tables 1-16.. 

1 54. A method of detecting a prostate cancer associated transcript, the 

2 method comprising contacting a biological sample from the patient with a plurality of 

3 polynucleotides wherein at least two of said polynucleotides selectively hybridize to a 

4 difference sequence at least 80% identical to a sequence as shown in Tables 1-16. 

1 55. A method of detecting a prostate cancer, the method comprising the 

2 steps of: 

3 (i) providing a biological sample from a patient; 

4 (ii) contacting the biological sample with a first polynucleotide that selectively 



5 hybridizes to a sequence at least 80% identical to a first sequence as shown in Tables 1-16 to 

6 determine the level of a prostate cancer-associated transcript in the biological sample; and 

7 with a second polynucleotide that selectively hybridizes to a second sequence at least 80% 

8 identical to a sequence not shown in Tables 1-16; wherein the expression of said second 

9 sequence is not substantially changed in prostate cancer, to detemine the level of expression 
10 of a control transcript in the biological sample; 
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1 1 (iii) comparing the level of the prostate cancer-associated transcript to a level 

12 of the normal tissue associated transcript in the biological sample. 

1 56. A method of quantitating a prostate cancer-associated transcript in a 

2 cell from a patient, the method comprising contacting a biological sample from the patient 

3 with a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a 

4 sequence as shown in Tables 1-16. 

1 57. The method of claim 56, wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1-16. 

1 58. The method of claim 56, wherein the biological sample is a tissue 

2 sample. 

1 59. The method of claim 56, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 60. The method of claim 56, wherein the nucleic acids are mRNA. 

1 61. The method of claim 59, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 62. The method of claim 56, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-16. 

1 63. The method of claim 56, wherein the polynucleotide is labeled. 

1 64. The method of claim 63, wherein the label is a fluorescent label. 

1 65. The method of claim 56, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 66. The method of claim 56, wherein the patient is undergoing a 

2 therapeutic regimen to treat metastatic prostate cancer. 

1 67. The method of claim 56, wherein the patient is suspected of having 

2 metastatic prostate cancer. 
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1 68. A biochip comprising a plurality of polynucleotides that selectively 

2 hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-16. 

1 69. A method of screening drug candidates comprising: 

2 i) providing a cell that expresses an expression profile gene selected from the 

3 group consisting of an expression profile gene set forth in Tables 1-16 or fragment thereof; 

4 ii) adding a drug candidate to said cell; and 

5 iii) determining the effect of said drug candidate on the expression of said 

6 expression profile gene. 

1 70. A method according to claim 59 wherein said determining comprises 

2 comparing the level of expression in the absence of said drug candidate to the level of 

3 expression in the presence of said drug candidate. 

1 SF 1277890 vl 
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